Unit C4 
Eigenvectors 


1 Eigenvalues and eigenvectors 


Introduction 


By now you should be familiar with a wide variety of linear 
transformations from one vector space to another, and should appreciate 
that the matrix of a linear transformation depends on the bases chosen for 
the domain and codomain. In this final unit on linear algebra we 
concentrate on linear transformations from R? to R?, from R? to R and, 
more generally, from R” to R”, and address the following question. 


Is it possible to find a basis for both the domain and codomain so 
that the matrix of a linear transformation is a diagonal matrix? 


In the preceding units of this book you have studied vectors, matrices, 
vector spaces and linear transformations. The method for finding a 
diagonal matrix of a linear transformation (if such a matrix exists) links all 
these topics together. To round off the linear algebra topic, we use linear 
transformations and diagonal matrices to classify conics and quadrics. 


1 Eigenvalues and eigenvectors 


In this section you will see that some lines through the origin are mapped 
to themselves by some linear transformations from R? to R?: the 
individual points on these lines are usually moved, but, for a given line, all 
the points are scaled by a constant factor. You will see that this idea of 
fixed lines also applies to linear transformations from R? to R? and, more 
generally, from R” to R”. You will learn how determinants can be used for 
finding these fixed lines of linear transformations. 


1.1 What is an eigenvector? 


In Subsection 1.1 of Unit C3 Linear transformations you saw that a linear 
transformation t : R? —> R? moves the points of the plane around, but 
fixes the origin. Furthermore, parallel lines get mapped to parallel lines. In 
this section we will observe that t may map some lines through the origin 
onto themselves. These ‘unchanged’ lines are rather special. 


Consider the linear transformation t : R? — R? given by 


We know that t maps the origin (0,0) to itself, since this is a property of 
all linear transformations. 


We can calculate the image of the point (1,0): 
t(1,0) = (1+ (4x 0),1-— (2 x 0)) = (1,1). 


283 


Unit C4 Eigenvectors 


284 


Since linear transformations map lines through the origin to lines through 
the origin, t maps the line joining the points (0,0) and (1,0) to the line 
joining the points (0,0) and (1,1), as illustrated in Figure 1; that is, 


t maps the line y = 0 to the line y = a. 


YA YA 
= (1,1) 
(1,0) 
(0,0) z (0,0) z 


Figure 1 The image of the line y = 0 under the linear transformation t 


Let us now calculate the image of the point (1, —1): 
41,1) = (1+4(-1),1- 2(-1)) = (-3,3). 


In this case, the linear transformation t maps the line joining the points 
(0,0) and (1,—1) to the line joining the points (0,0) and (—3,3), as 
illustrated in Figure 2; that is, 


t maps the line y = -:r to itself. 


Although t moves individual points on the line (except (0,0)) to other 
points, the line as a whole is unchanged. 


(—3,3) Aj 
Ya 
t 
> — 
(0, 0) ss 

(1, =1) > 

(0, 0) m 

Figure 2 The image of the line y = —x under the linear transformation t 


The image of the point (1,—1) under t is the point (-3,3) = —3(1, —1). 
The vector (1,—1) is scaled (stretched) by a factor of —3; that is, the 
resulting vector is three times the original magnitude and pointing in the 
opposite direction. In the next exercise you will investigate how other 
vectors lying along the line y = —zx are moved by t. 


Exercise C115 


For the above linear transformation t, calculate the images of the vectors 
(2,2) and (—7,7). What do you notice? 


1 Eigenvalues and eigenvectors 


We have seen that the linear transformation t scales some vectors lying 
along the line y = —x by the factor —3. In fact this is true of any vector 
lying along this line, as we now show. 


Let k be any real number, so that (k,—k) = k(1,—1) is a vector lying 
along the line y = —r. Then 


t(k, —k) = (k — 4k, k + 2k) = (-3k,3k) = —3(k, —k), 


which shows that t has the same scaling effect on each vector (k, —k) lying 
along the line y = —2. 


Does the linear transformation t map other lines through the origin to 
themselves? 


Exercise C116 


(a) For the above linear transformation t, calculate t(0, 1), t(1,2) and 
t(4,1). 

(b) Use one of the solutions to part (a) to write down another line in R? 
that is mapped to itself by the linear transformation t. 


(c) Find t(4k, k). 


We have seen that the linear transformation t maps each of the lines 

y = —x and x = 4y to itself. In both cases, each vector along the line is 
moved to a scalar multiple of itself: each vector lying along the line y = —x 
is mapped to —3 times itself and each vector lying along the line x = Ay is 
mapped to 2 times itself. We call the non-zero vectors lying along the line 
y = —«x eigenvectors of t with corresponding eigenvalue —3; for example, 
(1,—1) and (—7,7) are eigenvectors of t with corresponding eigenvalue —3. 
Similarly, we call the non-zero vectors lying along the line x = Ay 
eigenvectors of t with corresponding eigenvalue 2; for example, (4,1) and 
(—8, —2) are eigenvectors of t with corresponding eigenvalue 2. 


More generally, we make the following definitions; here and throughout 
this unit we use V to denote a finite-dimensional vector space. 


Definitions 


Let t: V —> V be a linear transformation. An eigenvector of t is a 
non-zero vector v that is mapped by ¢ to a scalar multiple of itself; 
this scalar is the corresponding eigenvalue. 


In symbols, a non-zero vector v is an eigenvector of a linear 
transformation t if 


t(v) =Av, for some \ €R; 


A is the corresponding eigenvalue. 


285 


Unit C4 Eigenvectors 


David Hilbert 


Werner Heisenberg 


286 


We exclude the case v = 0, since t(0) = O for every linear 
transformation t. It is, however, possible for A to be 0: when A = 0, the 
linear transformation maps every vector corresponding to this eigenvalue 
to the origin — you will see an instance of this in Exercise C120. 


Eigen is a German word meaning own, characteristic or special. 
Another name for eigenvalue is characteristic value. 


The eigen terms are associated with the German mathematician 
David Hilbert (1862-1943) who first used the terms Eigenfunktion 
(eigenfunction) and Eigenwert (Eigenvalue) in a series of papers on 
integral equations (1904-1910). It is possible that Hilbert was 
following the German physicist Hermann von Helmholtz (1821-1894) 
who used the term Eigentöne in acoustics in the nineteenth century. 


In the 1920s the use of the eigen terminology was promoted through 
the development of the matrix mechanics formulation of quantum 
theory by the German physicist Werner Heisenberg (1901-1976) who 
wrote the new theory in the language of Hilbert and his followers. 


In the example above we found two lines that are mapped to themselves 
by t, by considering the images of various points. This is a rather 
hit-and-miss way of finding eigenvalues and eigenvectors. Before 
developing a general method for finding them, we see that it is sometimes 
possible to do so by considering the geometry of the transformation. 


Worked Exercise C62 


Let t : R? —> R? be the linear transformation that maps each point to its 
reflection in the x-axis. By considering the geometric features of t, 
determine as many eigenvectors of t as you can and write down the 
corresponding eigenvalue in each case. 


Solution 
Reflection in the x-axis maps each point (x,y) to the point (x, —y). 
@. A sketch can help. .® 


YA 
(0, k) 


1 Eigenvalues and eigenvectors 


Exercise C117 


By considering the geometric features of each of the following linear 
transformations of the plane, determine as many eigenvectors as you can 
and write down the corresponding eigenvalue in each case: 


(a) reflection in the line y = x 
(b) 2-dilation 
(c) anticlockwise rotation through 7/2 about the origin 


(d) anticlockwise rotation through m about the origin. 


In Exercise C117 it is possible to spot the eigenvectors geometrically. We 
now illustrate a general method to determine the eigenvalues and 
eigenvectors for any given transformation. 


Consider again the linear transformation t : R? — R? given by 
t(x, y) = (x a 4y, vc 2y). 


We wish to find those vectors (x,y) that are mapped to scalar multiples of 
themselves; that is, 


t(x,y) = A(x, y) = (Aa, dy). 
We equate the expressions for t(x, y) and obtain 


(a + 4y, x — 2y) = (Az, Ay). 


287 


Unit C4 Eigenvectors 


288 


Equating the first and second coordinates of these vectors, we obtain the 
system of linear equations 


x+4y = Ax 
x — 2y = Ay. 


This is a system of two equations in the three unknowns x, y and A. One 
way of solving this system is to move the terms on the right to the 
left-hand side. Thus we obtain the system 
(1-Ax+ dy =0 
ys (1) 
z+ (-2—A)y =0. 


Equations (1) are called the eigenvector equations. We use them to find 
the possible values of A, and then to find all the eigenvectors that 
correspond to these values. They are homogeneous equations in x and y 
since the constant terms are all zero. 


Systems of homogeneous linear equations always have the trivial solution, 
in this case x = 0, y = 0, but this corresponds to the zero vector, which is 
excluded. Thus we seek non-zero solutions to the pair of homogeneous 
equations (1). Since we have two equations in three unknowns, such a 
system is bound to be dependant; that is, the homogeneous system has 
insufficient constraints on the unknowns to determine them uniquely. 


From Theorem C19, Summary Theorem, in Unit C1 Linear equations and 
matrices we know that a homogeneous system has only the trivial solution 
if and only if the determinant of the coefficient matrix is non-zero. The 
contrapositive of this tells us that non-zero solutions exist if and only if the 
determinant of the coefficient matrix is 0; that is, if and only if 


1-1 4 


i ag-am 


We expand the determinant and obtain 


EN) mA 


which simplifies (after some algebra) to 
X +r-6=0. 


This equation is called the characteristic equation of t, and its solutions 
are the eigenvalues we seek. Notice that the characteristic equation, 
whether or not it is written in terms of a determinant, is a polynomial 
equation in A whose degree is the dimension of the domain of t — in this 
case 2. Here, we have 


Free 
so the eigenvalues are À = 2 and A = —3. 


To find the corresponding eigenvectors, we consider each eigenvalue X in 
turn. 


1 Eigenvalues and eigenvectors 


Putting A = 2 into the eigenvector equations (1), we obtain 


-z+4-=0 
xz — 4y =0. 


One equation is —1 times the other, so the equations are equivalent 
to the single equation 


x= Ay. 


Thus the eigenvectors corresponding to A = 2 are the non-zero 
vectors (x,y) for which x = 4y; that is, the vectors of the form 


(4k,k), where k £0. 


Since we are working in a real vector space, in this case R?, when we are 
talking about eigenvectors, k represents a real number. 


Putting A = —3 into the eigenvector equations (1), we obtain 


4x + 4y = 0 
r+ y=O0. 


These equations are equivalent to the single equation 


y = T. 
Thus the eigenvectors corresponding to A = —3 are the non-zero 
vectors (x, y) for which y = —2; that is, the vectors of the form 


(k,—k), where k # 0. 
Thus the eigenvectors of t are the non-zero vectors of the following forms: 
(4k, k), corresponding to = 2, 
(k, —k), corresponding to \ = —3. 


This method produces all the eigenvalues and eigenvectors of the linear 
transformation. On the other hand, trying to show that these are the only 
ones by calculating the images of various points, as we started to do at the 
beginning of the section, would take forever! 


Exercise C118 


Let t : R? — R? be the linear transformation given by 
(a) Find the eigenvector equations of t. 


(b) Find the characteristic equation of t, and solve it to find the 
eigenvalues of t. 


(c) Solve the eigenvector equations, for each eigenvalue in turn, to find 
the eigenvectors of t. 


289 


Unit C4 Eigenvectors 


290 


1.2 Finding eigenvalues and eigenvectors 


You have just seen how to find the eigenvalues and eigenvectors of a given 
linear transformation t : R? — R?. This method, as it stands, is rather 
tedious to use to find eigenvalues and eigenvectors of linear 
transformations from R? to RÌ, or R* to Rt, and so on. However, by 
introducing matrices, we can simplify the method. 


We now work through the same example as in the previous subsection, but 
this time we use matrices. 


Theorem C40 of Unit C3 tells us that there is a unique matrix for t with 
respect to the standard (ordered) basis in both the domain and codomain, 
and we use Strategy C15 from that unit to find this matrix. Recall that 
this strategy tells us essentially to ‘read off’ the matrix of a linear 
transformation when we are using the standard bases. We have 

t(1,0) = (1,1) and £(0,1) = (4, —2), so these vectors are the columns of the 
matrix of the linear transformation, since we are using the standard bases. 


Therefore, with respect to the standard basis for R?, the linear 
transformation t given by t(x, y) = (a + Ay, x — 2y) has the matrix 
representation 


t:v-— Avy, where v = (7) and A= (} aE 
yY 1 —2 


If v is an eigenvector of t with corresponding eigenvalue A, then 
iV) = àv; 


in matrix form, this becomes 


90-0 
a +) (;) = (5) = (o) (2) 


Using the 2 x 2 identity matrix I, we can write 


(=a 


so equation (2) can be written as 


Lobo D) 


1 Eigenvalues and eigenvectors 


We simplify this matrix equation and obtain 


GEN @) = 0): 


This gives rise to the eigenvector equations 


tae Ay = 0 
x + (-2—A)y =0, 


as before, which we labelled equations (1). The characteristic equation is 


1-2 4 
b =o. | = 
that is, 
det(A — AT) = 0. 


We can therefore find the characteristic equation directly from the matrix 
of the linear transformation (with respect to the standard basis for both 
the domain and codomain) by subtracting X from each diagonal entry and 
then equating the determinant to zero. 


Once we have found the eigenvalues, we use the same method as before to 
find the eigenvectors; that is, we substitute each eigenvalue in turn into the 
eigenvector equations and solve them. 


In view of this connection with matrices, we adopt the following definitions. 


Definitions 


A non-zero vector v is an eigenvector of a square matrix A if 
Av = àv, for some à €R; 


A is the corresponding eigenvalue. 


The characteristic equation of a square matrix A is the equation 


det(A — AI) = 0. 


In this way we can refer to eigenvectors, eigenvalues and the characteristic 
equation of a matrix even when a linear transformation is not explicitly 
involved. 


The matrix A — AI is obtained by subtracting A from each entry on the 
diagonal of A. 


291 


Unit CA Eigenvectors 


Eigenvalues and eigenvectors of matrices occur naturally in many 
applications — for example, in the study of vibrating mechanical 
systems. In such examples, the characteristic equation may have 
solutions that are not real numbers, and these complex eigenvalues 
have significance in these applications. In this unit we are primarily 
interested in linear transformations of the plane and of 
three-dimensional space, so complex eigenvalues play no role here: we 
are concerned only with real eigenvalues and eigenvectors. 


Other areas of application include music, bridge design, oil 
exploration, image compression, and analysis of financial data. A 
particular example is the use of eigenvectors in the PageRank 
algorithm. This algorithm was invented by Larry Page and 

Sergey Brin, the founders of Google, in 1996 for use by the Google 
search engine to rank the importance of web pages. According to 
Google, PageRank works by counting the number and quality of links 
to a page to determine a rough estimate of how important the website 
is. The underlying assumption is that more important websites are 
likely to receive more links from other websites. The algorithm assigns 
a PageRank, or score, to each web page based on its linking web 
pages, with the links from different web pages being weighted 
according to particular criteria. The Google matrix represents the 
links between the web pages. A fundamental part of the algorithm is 
an iterative method that computes the dominant eigenvalue, that is, 
the eigenvalue of largest magnitude, and the corresponding 
eigenvector of the Google matrix to rank the web pages. 


Larry Page and Sergey Brin If a characteristic equation has no real solutions, then we say that there 
are no eigenvalues. For example, in Exercise C117(c), you considered the 
linear transformation representing an anticlockwise rotation through 7/2 
about the origin. The matrix of this linear transformation is 


TE: 


By the above definition, the characteristic equation of this linear 
transformation is 


0-X -1 
det(A — AI) = 1 = = 0. 
We expand the determinant and obtain 
MW +1=0. 


This equation has no real solutions: the linear transformation has no 
eigenvalues and hence no eigenvectors. This agrees with the geometric 
interpretation: no line through the origin is mapped to itself by this 
rotation. 


We summarise this matrix method for finding eigenvalues and eigenvectors 
in the following strategy. 


292 


1 Eigenvalues and eigenvectors 


Strategy C18 


To determine the eigenvalues and eigenvectors of a square matrix A, 
do the following. 


1. Find the eigenvalues: 
e write down the characteristic equation 
det(A — AI) = 0 
e expand this determinant to obtain a polynomial equation in A 
e solve this equation to find the eigenvalues. 
2. Find the eigenvectors: 
e write down the eigenvector equations 
(A — AI)v =0 
e for each eigenvalue A, solve this system of linear equations to 
find the corresponding eigenvectors. 


We illustrate Strategy C18 with the following worked exercise and exercise. 


Worked Exercise C63 


Let t : R? — R? be the linear transformation given by 


t(z,y) = (5x + 2y, 2x + 5y). 


Write down the matrix of t with respect to the standard basis for R?, and 
find the eigenvalues and eigenvectors of t. 


293 


Unit C4 Eigenvectors 


294 


which simplifies to 
V= MAAA 
The eigenvalues of A are therefore \ = 7 and À = 3. 


Next we find the eigenvectors of A. 


@®. The eigenvector equations are (A — AI)v = 0; that is, 


o 


which we write as a system of linear equations. .® 


The eigenvector equations are 


(5—A)a + N 
2x +(5-Ay=0. 


A=7 | The eigenvector equations become 


=e s+ Zn =O 
2 = Pi) = 0. 


These equations are equivalent to the single equation 
V) = 3. 


Thus the eigenvectors corresponding to A = 7 are the non-zero 
vectors for which y = x; that is, the vectors of the form 


(k, k), where k 40. 
A=3 | The eigenvector equations become 
2 se 20 = 0, 
22 =p By = O, 
These equations are equivalent to the single equation 
y= —T. 


Thus the eigenvectors corresponding to A = 3 are the non-zero 
vectors for which y = —2; that is, the vectors of the form 


(k,—k), where k 40. 


Thus the eigenvectors of the linear transformation t are the non-zero 
vectors of the following forms: 


(k,k), corresponding to \ = 7, 
(k, —k), corresponding to A = 3. 


1 Eigenvalues and eigenvectors 


Exercise C119 


For each of the following linear transformations t : R? — R?, write down 
the matrix of t with respect to the standard basis for R?, and find the 
eigenvalues and eigenvectors of t. 


(a) t(x, y) = (x +3y,2x — 4y) (b) t(x,y) = (x — 2y, —2x — 2y) 


So far we have concentrated on linear transformations from R? to R? and 
on 2 x 2 matrices. We now use Strategy C18 to find the eigenvalues and 
eigenvectors of a linear transformation from R° to R? using a 3 x 3 matrix. 
Notice that here the characteristic equation is again a polynomial equation 
in A whose degree is the dimension of the domain of t — in this case 3. 


Worked Exercise C64 


Let t : R? — R? be the linear transformation given by 
t(2,y,2) = (22 + 2,—x + 2y + 3z,x + 22). 


Write down the matrix of t with respect to the standard basis for R°, and 
find the eigenvalues and eigenvectors of t. 


Solution 


®. Since we are using the standard basis, we can again simply ‘read 
off’ the matrix: the columns are the images of (1,0,0), (0,1,0) and 
(0,0,1) under t. & 


The matrix of t with respect to the standard basis for R? is 


2 @ il 
A=[-1 2 3 
i @ 2 


We use Strategy C18 to find the eigenvalues and eigenvectors of A, 
which are the same as those of t. 


First we find the eigenvalues of A. 


®. Here we need the 3 x 3 identity matrix I = and so 


oor 
Seo 
=. oOo o 


subtract À from the three diagonal entries of A. .® 
The characteristic equation is det(A — AI) = 0; that is, 
2— À 0 1 
-1 2-x 3 |=0) 
1 0 2—-\ 


295 


Unit C4 Eigenvectors 


296 


We expand the determinant and obtain 


2—A 3 -1 2-A 


@-»P 5 2-2 ne 


Simplifying this expression, we obtain 
(2— r)((2— A)? — 0) + © -—(2—A)) =0. 


@®. When there is a common factor, it is best to keep this separate: 
the problem then reduces to factorising the remaining quadratic 
polynomial. © 


Taking out the common factor gives 


(2— A)((2—A)’-1) =0, 


which simplifies to 
=u see 

We can factorise this characteristic equation as 
(2—A)(A—3)(A—1) =0. 

The eigenvalues of A are therefore \ = 3, A = 2 and \ = 1. 


Next we find the eigenvectors of A. 


The eigenvector equations are 


(2 — A)z + 20 
—x + (2 — A)y + gz 0 
ae + (2—A)z=0. 


A=3 | The eigenvector equations become 


=o a 2=el 
=f = Var 02 =U 
T — z=0. 


®. It may sometimes be necessary to use the method of 
Gauss-Jordan elimination from Unit C1, but here the solutions 
can be found directly. .® 


The first and third equations imply that 
a 

Substituting this into the second equation yields the equation 
De =y = 0. 


Thus the eigenvectors corresponding to A = 3 are the non-zero 
vectors (x,y,z) satisfying z = x and y = 2z; that is, the vectors 
of the form 


(k,2k,k), where k 40. 


1 Eigenvalues and eigenvectors 


A=2| The eigenvector equations become 


20 
-7 +37 =0 
T Z0: 


These equations have the solution 
zel m Sell, 


However, there are no constraints on the unknown y. Thus the 
eigenvectors corresponding to A = 2 are the non-zero vectors 
(x,y,z) satisfying x = 0 and z = 0; that is, the vectors of the 
form 


(0,4,0), where k 40. 


A=1 | The eigenvector equations become 


T + z=0 
SH = War Be = 
gb + z=0. 


The first and third equations imply that 
Z=-2. 

Substituting this into the second equation yields the equation 
=A oy = 0. 


Thus the eigenvectors corresponding to A = 1 are the non-zero 
vectors (x,y, z) satisfying z = —x and y = 4z; that is, the 
vectors of the form 


(k,4k,—k), where k 40. 


Thus the eigenvectors of the linear transformation t are the non-zero 
vectors of the following forms: 

(k,2k, k), corresponding to A = 3, 

(0, k,0), corresponding to A = 2, 

(k, 4k, —k), corresponding to A = 1. 


Although cubic polynomials may not always be easy to factorise, you met 
some ways of factorising such polynomials in Subsection 1.4 of Unit A2 
Number systems. However, we will usually deal with examples that 
factorise easily. 


297 


Unit CA Eigenvectors 


298 


The following result, which we do not prove here, gives a useful check on 
the values found for the eigenvalues. You are asked to prove it yourself for 
2x 2 matrices in the additional exercises booklet for this unit. 


Proposition C56 


The sum of the eigenvalues of a square matrix A is equal to the sum 
of the diagonal entries of A. 


For example, in Worked Exercise C64 the eigenvalues are 3, 2 and 1, which 
sum to 6, and the diagonal entries of the matrix A are 2, 2 and 2, which 
also sum to 6. 


The sum of the diagonal entries of a square matrix is sometimes referred to 
as the trace of the matrix. 


Exercise C120 
Let t : R? —> R? be the linear transformation given by 


t(z,y,z) = (4x + 2y, 2x + 3y + 2z, 2y + 22). 


Write down the matrix of t with respect to the standard basis for R?, and 
find the eigenvalues and eigenvectors of t. 


In most of the examples we have seen so far, the eigenvalues have not been 
easy to recognise directly and Strategy C18 has been required to find 
them. This is not always the case, as the following exercise illustrates. 


Exercise C121 


Find the eigenvalues of each of the following matrices. 


T 8 0 0 4 00 
(a) G J (b) [0 -5 0 (c) {25 -2 0 
0 0 21 17 r 6 


Finding eigenvalues of triangular and diagonal matrices is straightforward, 
as Exercise C121 illustrates. The eigenvalues are the diagonal entries of 
the matrix and no calculation is needed to find them. 


Theorem C57 


The eigenvalues of a triangular matrix and of a diagonal matrix are 
the diagonal entries of the matrix. 


1 Eigenvalues and eigenvectors 


Proof ®. A lower triangular matrix has every entry above the main 
diagonal zero. A diagonal matrix and the transpose of an upper triangular 
matrix are lower triangular matrices, so we can consider just lower 
triangular matrices here. .® 


Let A = (a;;) be an n x n lower triangular matrix, so aj; = 0 for all j > i. 
The eigenvalues of A are the solutions to the characteristic equation 
det(A — AI) = 0. Now A — Al has diagonal entries aj; — A, and every entry 
above the main diagonal is zero. 


@. We expand the determinant along the top row and continue by 
expanding along the top row of the resulting determinants until the only 
determinants in the expression are of size 2 x 2. & 


The first term in the full expansion of the determinant is the only non-zero 
term in the expansion because of the placement of the zeros in the smaller 
determinants. This non-zero term is (a11 — A)(a22 — A) +++ (Ann — A). 
Therefore the solutions to the characteristic equation det(A — AI) = 0 are 
a11, 422, ---; Ann, by the Factor Theorem (Theorem A2 in Unit A2), and 
the eigenvalues of A are precisely the diagonal entries of the matrix. 


A diagonal matrix is lower triangular and det AT = det A, so the 
eigenvalues of a triangular or diagonal matrix are the diagonal entries. E 


1.3 Eigenspaces 


In Subsection 1.1 we considered the linear transformation t : R? —> R? 
given by 


and saw that each of the lines y = —a and x = 4y is mapped to itself. 


The line y = —x, shown in Figure 3, consists of the points of the form 

(k, —k), each of which is an eigenvector of t corresponding to the 

eigenvalue A = —3, except when k = 0, which is specifically excluded. 

Similarly, the line x = 4y, also shown in Figure 3, consists of the points of Figure 3 The lines 

the form (4k, k), each of which is an eigenvector corresponding to the comprising the eigenvectors 
eigenvalue A = 2, except when k = 0. oft 


For each eigenvalue A, if we look at all the solutions to the equation 
t(v) = Av (including v = 0), then we obtain a line through the origin. The 
set of such solutions is a subspace of the domain of t. 


Theorem C58 


Let t: V — V hbe a linear transformation. For each eigenvalue X of t, 
let S(A) be the set of vectors satisfying t(v) = Av; that is, S(A) is the 
set of eigenvectors corresponding to A, together with the zero 

vector 0. Then S(X) is a subspace of V. 


299 


Unit C4 Eigenvectors 


300 


Proof Consider any eigenvalue A of a linear transformation t:V — V. 


®. We use Strategy C10 from Unit C2, Vector spaces and first check that 
0 € S(\). & 


For any linear transformation t, we have t(0) = 0 = 0, so 0 € S(A). 
@. Next we check that if v1, v2 € S(A), then vi + v2 E€ S(\). .® 
Let vi, v2 € S(A). Then 
t(vi + va) = t(vi) + t(v2) = Avi + Ave = A(v1 + va), 
since t is a linear transformation. 
Hence vj + v2 € S(A). 
®. Finally, we check that if ve S(A) and a € R, then av € S(\). # 
Let v € S(A) anda € R. Then 
t(av) =at(v) = adv = X(av), 
since t is a linear transformation. 
Hence av € S(A). 
Thus S(A) is a subspace of V. E 


Since S(A) is a subspace comprising eigenvectors (and 0), we call it an 
eigenspace. 


Definition 
Let t: V — V be a linear transformation and, for each eigenvalue A 


of t, let S(A) be the set of vectors satisfying t(v) = Av. Then S(A) is 
the eigenspace of t corresponding to the eigenvalue A. 


Worked Exercise C65 


Let t : R? —> R? be the linear transformation given by 
t(z,y,z) = (4x + 2y, 2x + 3y + 2z, 2y + 22). 


Find the eigenspace S(0) of t, specify a basis for it and state its dimension. 


(You found the eigenvalues and eigenvectors of this linear transformation 
in Exercise C120.) 


1 Eigenvalues and eigenvectors 


Any vector in S(0) can be written as k(1,—2,2), so 


{(1,-2,2)} 
is a basis for $(0). Thus 5(0) has dimension 1. 


®. Geometrically, S(0) is a line through the origin in the direction of 
the vector (1, —2,2), so the only eigenvectors of t corresponding to 
à = 0 are on this line. & 


Exercise C122 


Let t : R? — R? be the linear transformation given by 
t(x,y, z) = (4x + 2y, 2a + 3y + 2z, 2y + 22). 


Find the eigenspaces $(6) and S(3) of t. In each case, specify a basis and 
state the dimension of the eigenspace. 


(In Exercise C120 you found that the eigenvectors of t are the non-zero 
vectors (2k, 2k,k) and (—2k,k, 2k), corresponding to the eigenvalues A = 6 
and A = 3, respectively.) 


Worked Exercise C66 


Let t : R? — R? be the linear transformation given by 


t(x, yz) = (0,4,2)- 


Find all the eigenspaces of t. In each case, specify a basis and state the 
dimension of the eigenspace. 


Solution 


The matrix of t with respect to the standard basis for R? is 


0 0 0 
AK=(|@ 1 © 
Vz 


This matrix is diagonal, so the eigenvalues are the diagonal entries: 
A=, Alam A= 


The eigenvector equations are 
—AT = 


(1—A)y = 
(1—A)z =0. 


301 


Unit C4 Eigenvectors 


302 


A=0 | The eigenvector equations become 


Ov = 0, 7 =O ancl zg = 0. 


Thus the eigenvectors corresponding to the eigenvalue A = 0 are 
the non-zero vectors (x,y,z) satisfying y = 0 and z = 0; that is, 
the vectors of the form 


(k,0,0), where k 4 0. 

The eigenspace S(0) is the set of vectors 
{(k,0,0):k € R}. 

Any vector in S(0) can be written as k(1, 0,0), so 
{(1, 0, 0)} 

is a basis for $(0). Thus S(0) has dimension 1. 

®. Geometrically, $(0) is the z-axis in R3. .® 


A=1 The eigenvector equations reduce to the single equation 


—x=0. 


Thus the eigenvectors corresponding to the eigenvalue À = 1 are 
the non-zero vectors (x,y,z) satisfying x = 0; that is, the 
vectors of the form 


(0,k,1), where k and l are not both 0. 

The eigenspace S(1) is the set of vectors 
{(0, k,l) : k,l € R}. 

Any vector in S(1) can be written as k(0,1,0) +1(0,0,1), so 
{(0, 1,0), (0,0, 1)} 

is a basis for S(1). Thus S(1) has dimension 2. 

®. Geometrically, $(1) is the plane x = 0 through the origin. .® 


In Worked Exercise C66 the (simplified) characteristic equation of the 
linear transformation t is 


AA=17 =0. 


The eigenvalue A = 1 is a ‘repeated’ solution of this characteristic 
equation; it is a multiple root and we say that A = 1 has multiplicity 2 
because the factor (A — 1) occurs twice. 


In general, we adopt the following definition. 


1 Eigenvalues and eigenvectors 


Definition 

If the characteristic equation of a square matrix A can be written as 
(A= A A= Aa) A= A =; 

where Aj, A2,...,Ap are distinct, then the eigenvalue A; of A has 


multiplicity m;, for 7 =1,2,...,p. 


For a triangular or diagonal matrix, the multiplicity of an eigenvalue is the 
number of times it appears on the main diagonal. 


Exercise C123 


Find the eigenvalues and eigenvectors of the matrix 


11-1 
04 0 
00 4 


For each eigenvalue A, state its multiplicity, find the corresponding 
eigenspace S(A), specify a basis for S(A) and state its dimension. 


From the examples that you have seen so far, you may be tempted to 
conjecture that the dimension of the eigenspace S(A), for a given 
eigenvalue A, is equal to the multiplicity of A. The following exercises give 
you the chance to investigate this conjecture. 


Exercise C124 


Find the eigenvalues and eigenvectors of the matrix 


(0 1): 


For each eigenvalue A, state its multiplicity, find the corresponding 
eigenspace S(A), specify a basis for S(A) and state its dimension. 


Exercise C125 


Find the eigenvalues and eigenvectors of the matrix 


1-10 
1 4 1 
=, 1 4 


For each eigenvalue A, state its multiplicity, find the corresponding 
eigenspace S(A), specify a basis for S(A) and state its dimension. 


Hint: Look for factors in the characteristic equation and remember that 
z*—1=(r-1)(x +1). 


303 


Unit C4 Eigenvectors 


304 


In Exercise C124 the eigenvalue A = 1 has multiplicity 2, but it gives rise 
to an eigenspace of dimension only 1. In this case, the matrix represents a 
shear in the x-direction by a factor 1, as shown in Figure 4, and the only 
line through the origin left unchanged is the x-axis. Thus there is a single 
one-dimensional eigenspace, so the conjecture that the dimension of the 
eigenspace S(A) is equal to the multiplicity of A is false. 


y y 


Figure 4 A shear in the z-direction by a factor 1 


In Exercise C125 both eigenspaces have dimension 1 despite the 
eigenvalue 2 having multiplicity 2 and the eigenvalue 5 having 
multiplicity 1. In general, it can be shown that the dimension of an 
eigenspace cannot exceed the multiplicity of the corresponding eigenvalue, 
but we will not prove this. 


2 Diagonalising matrices 


In this section you will use the methods of finding eigenvalues and their 
corresponding eigenvectors that you met in the previous section to address 
the question posed in the introduction: 


Is it possible to find a basis for both the domain and codomain so 
that the matrix of a linear transformation is a diagonal matrix? 


It is therefore important that you are confident with the material in 
Section 1 before starting to study this section. 


2.1 Eigenvector bases 


In Section 1 we introduced the notions of an eigenvalue A and 
corresponding eigenvector v of a linear transformation t : R” — R”; 
that is, a non-zero vector v whose image t(v) is Av. For example, in 
Exercise C119(a) you saw that the linear transformation t : R? — R? 
given by 


has eigenvalues A = —5 and A = 2 with corresponding eigenvectors the 
non-zero vectors of the forms (k, —2k) and (3k, k), respectively. We can 
choose any value of k (k 4 0) to specify specific eigenvectors; here, putting 
= 1 in both gives (1, —2) and (3,1). Since (3,1) is not a multiple of 
(1, —2), these two eigenvectors are linearly independent (this is the case 
whatever values of k are chosen). Therefore, by Theorem C25 in Unit C2, 
these linearly independent eigenvectors form a basis for R? — the domain 
and codomain of t. We say that {(1, —2), (3,1)} is an eigenvector basis of t. 


2 Diagonalising matrices 


Definition 
Let t : R” — R” be a linear transformation and let E be a basis 


for R” consisting of eigenvectors of t. The basis E is an eigenvector 
basis of t. 


Exercise C126 


Verify that {(—2, 1), (1,2)} is an eigenvector basis of the linear 
transformation t : R? — R? given by 


t(a,y) = (x — 2y, —2x — 2y). 


(In Exercise C119(b) you found that the eigenvectors of t are the non-zero 
vectors (—2k, k) and (k, 2k), corresponding to the eigenvalues A = 2 and 
A = —3, respectively.) 


Exercise C127 


The set E = {(0,1,—1), (—2, 1,0), (1,0, -1)} is a basis for R®. Verify that 
E is an eigenvector basis of the linear transformation t : R? — R? given by 


t(z,y,2) = (-x + 2y + 22, 22 + 2y + 2z, —3x — 6y — 62). 


In Unit C3 you met Strategy C15 for finding the matrix representation of a 
linear transformation t : V — W with respect to given bases E and F for 
the domain and codomain of t. In this subsection you will see that this 
matrix representation is particularly simple f W=V, E is an eigenvector 
basis of t and F = E. 


Recall that if E = {eı,ea,...,e„} is a basis for V, and v is a vector in V 
such that v = v1e1 +--+ + Unen, then the numbers v1,...,Un are the 
E-coordinates of v, and vg = (v1,...,Un)z is the E-coordinate 
representation of v. If E is the standard basis for V, then we usually omit 
the suffix E. 


We begin by rewriting Strategy C15 for the particular case when W = V 
and F = E (not necessarily an eigenvector basis). 


Strategy C19 (Strategy C15 with W = V and F = E) 


To find the matrix A of a linear transformation t:V —> V with 
respect to the basis E = {e1,€2,...,@n}, do the following. 


1. Find t(e1), t(e2),...,t(en). 
2. Find the E-coordinates of each of these image vectors. 


3. Construct the matrix A column by column using the E-coordinates 
of te, te form: column 4, tory = 1,2,..n. 


305 


Unit C4 Eigenvectors 


306 


In the next worked exercise we illustrate what happens when we find the 
matrix of a linear transformation t with respect to an eigenvector basis of t. 


Worked Exercise C67 


Consider the linear transformation t : R? — R? given by 


(a) Write down the matrix of t with respect to the standard basis for R?. 


(b) Find the matrix of t with respect to the eigenvector basis 


E = {(1,—2), (3, 1)}. 


Solution 


(a) The matrix of t with respect to the standard basis for R? is 


(2 i) 


(b) Following Strategy C19, first we find the images of the vectors in 
the basis E = {(1, —2), (3, 1)}: 


t(1,—2) = (-5,10) and £(3,1) = (6,2). 


@®. We now write these image vectors in terms of their 
coordinates with respect to the eigenvector basis; that is, we 
express each of these vectors as a linear combination of the basis 
vectors E = { (1, —2), (3,1)}. The resulting calculations are 
remarkably straightforward! .® 


Next we find the E-coordinates of each of these image vectors: 
(—5, 10) = —5(1, —2) + 0(3, 1) 
= (=), 0) x, 
(6,2) = 0(1, —2) + 2(3, 1) 
= (0, 2)8 


Therefore £(1,—2) = (-5,0)x and t(3,1) = (0,2)z. So the matrix 
of t with respect to the eigenvector basis E is 


(o 3): 


In Worked Exercise C67(b) we found that the matrix of t with respect to 
the eigenvector basis is diagonal and that its diagonal entries are the 
eigenvalues of the linear transformation t. This is because the matrix of 
the linear transformation t maps the basis vectors to their images under t, 
but these basis vectors are precisely the eigenvectors that get mapped to 


multiples of themselves. You should find a similar outcome in the next 
exercise. 


Exercise C128 


Consider the linear transformation t : R? — R? given by 


t(x, y) = (x — 2y, —2x — 2y). 


(a) Write down the matrix of t with respect to the standard basis for R?. 


(b) Find the matrix of t with respect to the eigenvector basis 


E= {(-2, 1), (1, 2)}, 


which you found in Exercise C126. 


Worked Exercise C67(b) and Exercise C128(b) are special cases of the 
following result. We use the letter D in this result because the matrix is 
diagonal. 


Theorem C59 


Let t : R” — R” be a linear transformation, let E = {e1,e2,...,en} 
be an eigenvector basis ott and ker ej) = Ge, r j nn 
Then the matrix of t with respect to the eigenvector basis E is 


Dg O T) 
DA a a 
O O v Aa 


Proof Let t and E be as in the statement of the theorem. We use 
Strategy C19 to find the matrix of t with respect to the eigenvector 
basis E. 


@®. Eigenvector ej corresponds to eigenvalue àj. & 
We have 
ife;) = Ajey, forj =1, 2, n. 
We find the E-coordinates of each of these image vectors: 


t(e1) = Aıeı H 0e +-+- Oe, = (å1,0,...,0)F, 
t(e2) = Oe; + Aaea ---+0e, = (0, Aa, fa ,O)z, 


t(en) = Oe, + 0e2 +--+ Anen = (0,0,...,An)E- 


2 Diagonalising matrices 


307 


Unit C4 Eigenvectors 


So the matrix of t with respect to the eigenvector basis E is 


Xio Ü ar Ü 
psj? 2 T 
D dea Ye 
as claimed. 


Using this result we can easily write down the matrix of a linear 
transformation with respect to an eigenvector basis. 


Exercise C129 


Consider the linear transformation t : R? — R? given by 
t(x, y, z) = (-2 + 2y + 2z, 2x + 2y + 2z, —3x — 6y — 62), 
with eigenvector basis 
E = {(0,1, —1), (—2, 1,0), (1,0, —1)}. 


Use the solution to Exercise C127 to write down the matrix of t with 
respect to this eigenvector basis. 


2.2 Transition matrices 


Suppose that t : R” — R” is a linear transformation and E is an 
eigenvector basis of t. We have just shown that the matrix of t with 
respect to the eigenvector basis Æ is a diagonal matrix D. 


Figures 5 and 6 show the linear transformation t with respect to the 
eigenvector basis E and the standard basis, respectively. 


E E 


t: vg > Dvs 


Figure 5 The linear transformation t with eigenvector basis E for the 
domain and codomain 


308 


t:vr-> Av 


Figure 6 The linear transformation t with standard basis V for the domain 
and codomain 


It is natural to ask whether there is any relationship between this matrix D 
and the matrix A of t with respect to the standard basis for R”. It turns 
out that there is an algebraic relationship between the matrices D and A. 


We now show this relationship. To do this, first we find an algebraic 
relationship between the E-coordinate representation of a vector vg (as in 
Figure 5) and the standard coordinate representation of the same vector 
(as in Figure 6). We begin by doing this for the example that we 
considered at the beginning of the section, where t : R? — R? is the linear 
transformation given by 


t(z,y) = (x + 3y, 22 — 4y) 

and E is the eigenvector basis {(1,—2), (3, 1)}. 

Suppose that the E-coordinate representation of a vector v in R? is 
Ve = (a,b) p. 

What are the standard coordinates of v? 


In column form, 


O 1 BR 3\ _ fa+3b\_ 13 a 
ROT cold 1) \-2a+6/~ \-2 1) \b), 
Thus in matrix form we have 


v= Pyp, 


where 


p=(2 3) 


Now, by the Summary Theorem (Theorem C19 in Unit C1), a square 
matrix is invertible if and only if its determinant is non-zero. Here we have 
det P = 1 — (—6) = 7 £40, so P is invertible with inverse P-!. 


Since v = Pvp, it follows that 
P"!v=P!(Pv;) = (P’!P)vg = ve. 


So multiplication on the left by the matrix P converts the E-coordinate 
representation of a vector into the standard coordinate representation and, 
similarly, multiplication on the left by the matrix P~! converts the 
standard coordinate representation of a vector into the E-coordinate 
representation. 


2 Diagonalising matrices 


309 


Unit CA Eigenvectors 


310 


In this case the columns of P are formed from the standard coordinates of 
the vectors in E, but this is no coincidence. This simple relationship 
between the matrix P and the basis E always holds and we call P the 
transition matrix from the basis E to the standard basis for R?. 


The general definition is as follows. 


Definition 
Let E = {eı,e,...,e„} be a basis for R”. The transition matrix P 


from the basis FE to the standard basis for R” is the matrix whose jth 
column is formed from the standard coordinates of e;. 


Exercise C130 


(a) Write down the transition matrix P from the basis E = {(1,3), (2,5)} 
to the standard basis for R?. 


(b) Write down the transition matrix P from the basis E = {(0,1,—1), 
(—2, 1,0), (1,0, —1)} to the standard basis for R. 


In the example above, we have seen that the transition matrix P from the 
basis E = {(1, —2), (3, 1)} to the standard basis for R? converts 
E-coordinate representations into standard coordinate representations, and 
that P~! converts standard coordinate representations into E-coordinate 
representations. This is true in general. 


Theorem C60 


Let E = {eı,e,...,en} be a basis for R” and let P be the transition 
matrix from the basis E to the standard basis for R”. Then the 
standard coordinate representation of a vector in R” is given by 


v= Pve. 
Moreover, P is invertible and 


We = Poly. 


Proof ®. The matrix P converts the E-coordinate representation of a 
vector in R” to the standard coordinate representation of the same vector 
in R”, so in effect it is the matrix of the identity linear transformation 

U3 R” > R” with respect to the basis E in the domain and the standard 
basis in the codomain. .® 


The statement v = Pvz is equivalent to the statement that P is the 
matrix of the identity transformation ¿i of R” with respect to the basis E 
for the domain and the standard basis for the codomain. 


To find this matrix P, we use Strategy C15 from Unit C3. We begin by 
finding the images under 7 of the vectors in the domain basis E: 


i(e1) = €j, i(e2) =e), tas ilen) = Ën 


It now follows from Strategy C15 that each column of P is formed from 
the standard coordinates of the corresponding basis vector, so P is the 
transition matrix from the basis E to the standard basis for R”, as claimed. 


We know that the identity transformation 7 is invertible and that i~! = i. 
It follows from the Inverse Rule (Theorem C45 in Unit C3) that P is 
invertible and that P~! is the matrix of i : R” — R” with respect to the 
standard basis for the domain and the basis E for the codomain; that is, 


v— vg =P lV. | 


When E is the standard basis for R”, the matrix P is the identity 
matrix I,,, as you would expect. 


We also get the following corollary from Theorem C60. 


Corollary C61 


The rows or columns of an n x n matrix A form a set of n linearly 
independent vectors if and only if dt A 4 0. 


Proof Let A be ann x n matrix. 
®. We start by proving the only if part. & 


We first show that if the columns of A are linearly independent, then 
det A Æ 0. 


Suppose the columns are linearly independent, then the columns form a 
basis for R” and A is the transition matrix from this basis to the standard 
basis. Hence A is invertible by Theorem C60, and so det A Æ 0 by the 
Summary Theorem (Theorem C19 in Unit C1). 


®. If the rows of A are linearly independent then we consider the 
transpose AT. .@ 


Suppose the rows of A are linearly independent, then the columns of AT 
are linearly independent and det AT 4 0 by the above reasoning. We have 
det A = det AT by Theorem C14 in Unit C1, and hence det A Æ 0, as 
required. 


®. We now prove the if part using the contrapositive; that is, we show 
that if the rows or columns of A are not linearly independent then 
det A= 0. .® 


Suppose the rows of A form a linearly dependent set, then the row-reduced 
form of A contains a zero row, so A is not invertible by the Invertibility 
Theorem (Theorem C7 in Unit C1), and hence det A = 0 by the Summary 
Theorem. 


2 Diagonalising matrices 


311 


Unit C4 Eigenvectors 


Step 1 Step 3 


Step 2 


Figure 8 The transition in 
three steps 


312 


Suppose the columns of A form a linearly dependent set, then the rows of 
AT are linearly dependent and det A = det AT = 0 by the above reasoning. 


Hence, if det A Æ 0, then the rows or columns of A form a linearly 
independent set of vectors. | 


Recall that our aim in this subsection is to relate the matrices D and A, 
where D is the matrix of a linear transformation t : R” — R” with 
respect to an eigenvector basis of t, and A is the matrix of t with respect 
to the standard basis for R”. Figure 7 shows how we can do this by using 
the transition matrix P from the eigenvector basis E to the standard basis 
for R”, so linking together Figures 5 and 6. 


E E 


V V 


Figure 7 The transition matrix P from the eigenvector basis E of t to the 
standard basis for R” 


The top line of the diagram shows that multiplication by D converts the 
E-coordinate representation of v to the E-coordinate representation 
of t(v): 

t(v)g = Dve. (3) 
The diagram also shows that this change can be achieved in another way, 
in three steps, highlighted in Figure 8. 

1. Use the transition matrix P to convert the E-coordinate 

representation of v to the standard coordinate representation of v: 


v = Pvg. 


2. Multiply v on the left by matrix A to obtain the standard coordinate 
representation of t(v): 


t(v) = Av = APvp. 


3. Use the matrix P~! to convert the standard coordinate representation 
of t(v) to the E-coordinate representation of t(v): 


t(v)g = P’!t(v) =P 1APvez. 


Comparing this last equation with equation (3), we see that D, A and P 
are related by the equation 


D = PAP. 


Thus we have proved the following result. 


Theorem C62 


Let t : R” — R” be a linear transformation and let E be an 
eigenvector basis of t. Let A be the matrix of t with respect to the 
standard basis for R”, let D be the matrix of t with respect to the 
eigenvector basis E and let P be the transition matrix from E to the 
standard basis for R”. Then 


D = P'AP. 


In fact, Theorem C62 holds for any basis E for R”, although D is diagonal 
only when E is an eigenvector basis. 


Since D, A, P and P™! are all square n x n matrices, we can multiply 
D = P“!AP on the left by the matrix P and on the right by the matrix 
P~! to obtain the related equation 


A = PDP. 


This algebraic relationship A = PDP™t! may remind you of the algebraic 
relationship 


y=gorog 


between conjugate permutations x and y in the symmetric group Sn, which 
you met in Subsection 4.1 of Unit B3. You saw in Unit C1 that the set of 
square invertible n x n matrices form a group under multiplication, and 
here the change of basis is in some sense equivalent to the ‘renaming’ in 
permutations. The matrices D and A are conjugate matrices: we will not 
use this concept here, but you will meet this idea of conjugacy in groups 
again in Book E. 


We end this subsection by applying Theorem C62 to some examples. 


Consider the linear transformation t : R? — R? given by 


2 Diagonalising matrices 


313 


Unit CA Eigenvectors 


In Worked Exercise C67 you saw that 
1 3 
a=( a) 
is the matrix of t with respect to the standard basis for R? and that 
—5 0 
p= (o 3) 
is the matrix of t with respect to the eigenvector basis E = { (1, —2), (3, 1)}. 


At the beginning of this subsection you saw that the transition matrix 
from the basis E to the standard basis for R? is 


E ve 


Now, using Strategy C4 from Unit C1, we have 


1 
pial 1-3 _ [7 
zı2 1 2 


so 
1 3 
=j _ [7 7 1 3 1 3 
J ar- (i ) € Ap \-2 1 
7 7 
[-5 0 
u 0 2 
=D, 
as claimed. 


Exercise C131 


Use the solution to Exercise C128 to find a matrix P such that 
D = P“!AP, where 


1 -2 2 0 
ae, >) and beh 2. 


2.3 Diagonalisation 


In this subsection you will consider the problem of determining when a 
matrix is diagonalisable and how to diagonalise a matrix when it is 
possible. 


Definition 


The matrix A is diagonalisable if there exists an invertible matrix P 
such that the matrix 


D=P "AP 


is diagonal. 


314 


Clearly the matrices A, D and P must all be square matrices of the same 
size. 


If a matrix A is diagonalisable, then to diagonalise it we need to find both 
the diagonal matrix D and the invertible matrix P, since it is this 
transition matrix P that links the matrix A with the diagonal matrix D. 


One particular use of diagonalisation of matrices is to find powers of 
matrices. We saw earlier that multiplying D = P~'AP on the left by P 
and on the right by P~! gives A = PDP™!. Now consider powers of A, 


A? = (PDP~')(PDP~*) 
= PD(P-!P)DP-! 
= PDDP "|! 
= PD’P-!, 
and, in general we have 
A°=PD"P"!, forn=1,2,.... 


This last equation is useful for calculating powers of matrices, since 
calculating the nth power of a diagonal matrix is particularly simple: you 
need to find only the nth power of each diagonal entry. But first we need 
to be able to find both D and P (from which we can find P~'). 


Exercise C132 


(a) Write down DŽ, where D = (6 3) 


(b) Calculate A5, where A = € >) 


(In Exercise C131 you found that P = © ,) satisfies 
D=P-!AP)) 


If A is any n x n matrix, then we can define a linear transformation t as: 
t: R” — R” 
v — Av. 


In Section 1 we said that v is an eigenvector of A with corresponding 
eigenvalue A if Av = t(v) = Av; that is, if v is an eigenvector of t. 


Definition 
Let A be an n x n matrix and let E = {e1, €2,... , en} be a basis 


for R” consisting of eigenvectors of A. The basis E is an eigenvector 
basis of A. 


Thus E is an eigenvector basis of A if E is an eigenvector basis of t. 


2 Diagonalising matrices 


315 


Unit CA Eigenvectors 


316 


Worked Exercise C68 


Find an eigenvector basis of the matrix 


A=(5 A 


Suppose that E is an eigenvector basis of then x n matrix A; that is, E is 
an eigenvector basis of the linear transformation t : R” — R” given by 


t(v) = Av. 


It follows from Theorems C59 and C62 that if P is the transition matrix 
from the basis E to the standard basis for R”, then 


D=P !AP 


is diagonal; that is, A is diagonalisable. This gives the following strategy 
for diagonalising a matrix, when this is possible. 


Strategy C20 

To diagonalise an n x n matrix A: 

1. find all the eigenvalues of A 

2. find (if possible) an eigenvector basis E = {e1,e2,...,en} of A 


3. write down the transition matrix P whose jth column is formed 
from the standard coordinates of e;. 


Then 
nee 
Dee 


where A; is the eigenvalue corresponding to the eigenvector ej. 


The order of the eigenvalues down the diagonal of D must match the order 
of the eigenvectors in the basis E used to construct the transition 

matrix P. When asked to diagonalise a matrix, it is not enough to write 
down a diagonal matrix containing the eigenvalues: you must also give the 
transition matrix P. 


The complexity involved in finding an eigenvector basis of A in step 2 of 
Strategy C20 depends on the matrix A. In Worked Exercise C68 we 
formed an eigenvector basis of A by taking one eigenvector corresponding 
to each eigenvalue, ensuring that the eigenvectors were linearly 
independent. In general, we have the following result, which we will prove 
at the end of this subsection after looking at how it can be used. This 
result means that any eigenvector can be chosen for each (distinct) 
eigenvalue and there is no need to check that they are linearly independent. 


Theorem C63 


Let A be an n x n matrix with distinct eigenvalues A1,A2,...,An and 
corresponding eigenvectors €1, €2,..., €n: Then E = {e),e2,...,e,} is 
an eigenvector basis of A. 


We give an example of how Theorem C63 can be used. 


Worked Exercise C69 


Diagonalise the matrix 


2 Diagonalising matrices 


317 


Unit C4 Eigenvectors 


318 


It follows from Theorem C63 that we can form an eigenvector basis 
of A by taking one eigenvector corresponding to each of the three 
distinct eigenvalues. For example, 


B= 1, 2, i); (0, 1, 0), (1, 4, =1)} 
is an eigenvector basis of A. 


We use the eigenvectors in E to form the columns of the transition 
matrix: 


i 0) il 
Pee (2 i 4 
1 0 -1 
@®. Remember that the eigenvalues in D must appear in the same 
order as the corresponding eigenvectors in P. ® 


We use the eigenvalues corresponding to the eigenvectors in E to form 
the diagonal matrix: 


®. If the eigenvectors had been chosen in a different order, then the 
order of the columns of the transition matrix P and the order of the 
diagonal entries of the resulting matrix D would have been different. 


In addition, other transition matrices arise from using different 
eigenvectors for the eigenvector basis. 


Another solution is 


20.50) (Le eel 
P'AP. D orol where P=|2 —4 2 
GU g OT 


Both the order of the eigenvalues, and the eigenvectors chosen for the 
columns of P, differ here. © 


Exercise C133 


Diagonalise the matrix 


4 2 0 
A=|2 3 2 
0 2 2 


(In Exercise C120 you found that the eigenvectors of A are the non-zero 
vectors (2k, 2k, k), (—2k,k,2k) and (k, —2k, 2k), corresponding to the 
eigenvalues \ = 6, A = 3 and X = 0, respectively.) 


It may be possible to find an eigenvector basis of ann x n matrix A even 
when A does not have n distinct eigenvalues. 


Strategy C21 

To find an eigenvector basis of ann x n matrix A: 

1. find a basis for each eigenspace of A 

2. form the set E of all the basis vectors found in step 1. 


If there are n vectors in E, then E is an eigenvector basis of A; 
otherwise E is not a basis. 


The fact that FE, as found in Strategy C21, is an eigenvector basis of A if 
and only if there are n vectors in FE, can be proved in a similar way to 
Theorem C63, but the details are more complicated. 


Worked Exercise C70 


Diagonalise the matrix 


4 2 2 


2 Diagonalising matrices 


319 


Unit C4 Eigenvectors 


Subtracting the second equation from the first, we obtain 

—6x + 6y = 0, which implies that x = y. Substituting this into 
the third equation, we obtain 4x — 4z = 0, which implies that 
=, 


Thus S(8) = {(k,k,k):k ER}. 
A=2 | All three eigenvector equations become 
2w Ar 20 =e 2 = (0), 
that is, c+y+z=0,soz=-—(x+y). 
Thus S(2) = {(k,l,—(k +1)) : k,l € R}. 


®. Any vector in $(8) can be written as k(1,1,1), and any vector in 
S(2) can be written as k(1,0,—1) + 1(0,1,—1). .® 


A basis for 5(8) is {(1,1,1)} and a basis for S(2) is 
{(1,0,—1), (0,1,—1)}. The set 

E= 16 i 1) GE 0, =í); (0, 1, = 
contains three vectors, so it is an eigenvector basis of A. 


@. Note that Strategy C21 does not require us to prove linear 
independence of the vectors in E: combining the bases of the 

eigenspaces $(2) and S(8) gives a set of linearly independent 

vectors. .® 


We use the eigenvectors in E to form the columns of the transition 


matrix: 
il il 0 
P= ii 0 1 
1-1 -1 


We use the eigenvalues corresponding to the eigenvectors in E to form 
the diagonal matrix: 


8 00 
P-'AP=D= 10 2 0 
002 


Exercise C134 


Diagonalise the matrix 


1 0 0 
A=|0 21 
012 


320 


If the matrix A does not have an eigenvector basis, then these methods 
cannot be applied and the matrix A is not diagonalisable — there is no 
transition matrix. For example, in Exercise C124 you saw that all the 
eigenvectors of the matrix 


(1) 


are non-zero vectors of the form (k,0). Any two eigenvectors of A are 
linearly dependent, so there is no eigenvector basis. Thus there is no 
transition matrix and A is not diagonalisable. 


Similarly the matrix 


1-10 
B= 1 4 1 
=i 1 4 


from Exercise C125 is also not diagonalisable. The eigenvectors 
corresponding to the eigenvalue A = 2 of multiplicity 2 are the non-zero 
vectors of the form (k, —k, k), so any two eigenvectors of B in $(2) are 
linearly dependent. The other eigenvalue A = 5 has multiplicity 1. As 
stated at the end of Section 1, the dimension of an eigenspace cannot 
exceed the multiplicity of the corresponding eigenvalue, and so there 
cannot be two linearly independent eigenvectors corresponding to 
eigenvalue A = 5. 


Therefore there is no set of three linearly independent eigenvectors and 
thus no eigenvector basis; there is no transition matrix and thus B is not 
diagonalisable. 


We have shown that, if the matrix A of a linear transformation t has an 
eigenvector basis, then using this basis for both the domain and codomain 
results in a matrix of t that is a diagonal matrix. On the other hand, if 
there is an eigenvalue of multiplicity m for which there are fewer than m 
linearly independent eigenvectors, then there is no eigenvector basis and 
matrix A is not diagonalisable. 


We end this section by proving Theorem C63 as promised. 


Theorem C63 

Let A be an n x n matrix with distinct eigenvalues A1,Aa,...,An and 
corresponding eigenvectors €1, €2,..., €n. Then E = {eı,e2,...,en} is 
an eigenvector basis of A. 


Proof Let A and E be as in the statement of the theorem. 


®. Since any linearly independent set of n vectors in R” is a basis for R”, 
by Theorem C25 in Unit C2, we need show only that E is linearly 
independent. To do this, we assume that E is linearly dependent and 
obtain a contradiction. .® 


If E is linearly independent, then E must be an eigenvector basis of A. 


2 Diagonalising matrices 


321 


Unit C4 Eigenvectors 


322 


Suppose to the contrary that E is linearly dependent. Then we can take 
the smallest value of m (2 < m < n) for which a set of m vectors in E is 
linearly dependent. By relabelling the eigenvectors (if necessary), we can 
write 


aye + ageg +--+ + Amem = 0, (4) 
with ay £0, ag #0,..., am £ 0. 
Multiplying both sides of equation (4) by matrix A, we obtain 

A(a1eı + azgeg +--+ + Amem) = AOD, 
that is, 


aıAeı + agAe9 +: +A n,Aem = 0. 


Now, e€1,€2,---,@m are eigenvectors of A with corresponding eigenvalues 
Al, Aa, ren Ars so Ae; — Ajej and 
ayA1e1 + ap A2e€2 os peta cise ie AnAmem = 0. (5) 


We now eliminate the vector em. To do this, we multiply equation (4) by 
Am and subtract the result from equation (5): 


aı(Aı — Am)e1 + @2(A2 — Am)e2 + +++ + Am—1(Am-1 — Am)em—1 = 0. 


Since the eigenvalues Az, A2,..., Am are distinct, and none of the numbers 
Q1,Q2,...,Q@m_1 is zero, we deduce that the set of m — 1 vectors 
{e1,€2,...,@m_—1} is linearly dependent. This, however, is impossible since 
we assumed that m is the smallest number such that a set of m vectors in 
E is linearly dependent. This contradiction establishes the result. E 


3 Symmetric matrices 


In this section you will concentrate on diagonalising symmetric matrices. 
You will see that such matrices are always diagonalisable and that their 
transition matrices can be chosen to have particular properties. 


3.1 Diagonalising symmetric matrices 


Suppose that A is an n x n matrix and that we can find a basis 

{e1,€2,... , €n} for R” consisting of eigenvectors of A. In Section 2 you saw 
that A can be diagonalised: if P is the transition matrix whose columns 
are formed from the coordinates of the eigenvectors e1, €2,...,e€n, then 


P'AP 
is a diagonal matrix. 


In this section you will see that whenever A is an n x n symmetric matrix 
(a matrix where AT = A), then we can always find a basis for R” made up 
of eigenvectors of A, and so such a matrix is always diagonalisable. In fact, 
we can always find an orthonormal basis for R” made up of eigenvectors 
of A. Recall from Subsection 5.4 of Unit C2 that an orthonormal basis 


consists of mutually perpendicular (orthogonal) vectors of magnitude 1. 
For example, the standard basis for R” is an orthonormal basis. 


When we have an orthonormal basis, it turns out that the inverse of the 
transition matrix P is actually the transpose of P; that is, P7! = PT. This 
can be useful since finding the transpose of a matrix is much simpler than 
finding the inverse. We will prove this result as Theorem C65 in the next 
subsection where you will also see that orthogonal matrices have other 
useful properties. 


For example, consider the symmetric matrix 


4 2 0 
A=-|2 3 2 
0 2 2 


We will show that there is an orthonormal basis for R? that consists of 
eigenvectors of A. 


You found in Exercise C120 that the eigenvalues of A are A = 6, À = 3 and 
A = 0, and that the eigenvectors are the non-zero vectors of the following 
forms: 


(2k, 2k,k), corresponding to A = 6, 
(—2k,k,2k), corresponding to A = 3, 
(k, —2k,2k), corresponding to A = 0. 


Exercise C135 


Let vı = (2k, 2k, k), va = (—21,1, 21) and v3 = (m, —2m, 2m), where k,l, m 
are positive real numbers. 


(a) Show that {v1, v2, v3} is an orthogonal basis for RÌ. 


(b) Find values of k, l and m for which |vı| = |va| = |v3| = 1. 


In Subsection 5.4 of Unit C2 you saw that {vı,va,...,v„} is an 
orthonormal basis for R” if v;-v; = 0 for i Æ j, and |v;| = 1 for each i. It 
follows from Exercise C135 that 


_s(2 21 212) (1 22 
E = {(3, 3.3) »(—3,3>3) (35-33) 
is an orthonormal basis for R?. Since E is an eigenvector basis of A, we 
say that E is an orthonormal eigenvector basis of A. 


Following Strategy C20, we diagonalise the matrix A by writing down the 
transition matrix P whose columns are formed from the standard 
coordinates of the vectors in E: 


WIN WIN Wir 


P= 


WI Wily wire 
WI Wil wir 


3 Symmetric matrices 


323 


Unit CA Eigenvectors 


324 


A transition matrix formed from an orthonormal eigenvector basis in this 
way is called an orthogonal matrix. 


Definition 


An n x n matrix whose columns form an orthonormal basis for R” is 
an orthogonal matrix. 


It is important to remember that the columns of an orthogonal matrix are 
orthonormal vectors, not just orthogonal vectors, despite the name! 


Consider the 2 x 2 matrix 


1 1 
al oY 


v2 v2 
The columns of A (as vectors) are orthogonal since 
(= =) (= -=) -0 
V2’ V2) \v2 v2 
Orthogonal vectors are linearly independent, so the columns of A form a 
basis for R?. 


The columns of A (as vectors) also have magnitude 1 since 


1\° 1y? iy i: 
let =r] ehem el 
so the matrix A is an orthogonal matrix. 


Exercise C136 


Show that PTP = I, where 


WI WI Whe 


P= 


WIE wl WIN 
WIN WI WIND 


(P is the orthogonal matrix formed below Exercise C135.) 


We know that if PTP =I, then PP? =I (by Theorem C18 in Unit C1), 
so for the matrix P in Exercise C136, PT is the inverse of P; that is, 

PT = Pt. We will prove that PT = P~! for any orthogonal matrix P as 
Theorem C65 in the next subsection. 


It follows from this and Strategy C20 that 


6 0 0 
P7’AP=P '!AP=10 3 0 
00 0 


We say that the matrix A has been orthogonally diagonalised. 


Definition 


The matrix A is orthogonally diagonalisable if there exists an 
orthogonal matrix P such that the matrix 


D=P’TAP=P-!AP 


is diagonal. 


The following strategy is a modification of Strategy C20 for diagonalising a 
matrix. 


Strategy C22 

To orthogonally diagonalise an n x n symmetric matrix A: 

1. find all the eigenvalues of A 

2. find an orthonormal eigenvector basis E = {e1,e2,...,en} of A 


3. write down the orthogonal transition matrix P whose jth column 
is formed from the standard coordinates of e;. 


Then 
mM O e nO) 
e a 8), 
‘ee 


where A; is the eigenvalue corresponding to the eigenvector ej. 


In Section 4 you will see that orthogonal diagonalisation is used for 
classifying conics and quadrics. However, if the aim is simply to 
diagonalise a symmetric matrix as opposed to orthogonally diagonalise it, 
then use Strategy C20 — this saves time and effort when an orthonormal 
basis, or equivalently an orthogonal transition matrix, is not required. It is 
always a good idea to consider carefully what a problem requires you to do 
in order to solve it in the most efficient way. 


You may have noticed that the words ‘if possible’ appear in Strategy C20, 
but not in Strategy C22. This is due to the fact that an n x n symmetric 
matrix A always has an orthonormal eigenvector basis, so it must be 
orthogonally diagonalisable. It is also true that any orthogonally 
diagonalisable matrix A must be symmetric — you might like to prove this 
yourself; it is included as a ‘challenging’ exercise in the additional exercises 
booklet for this unit. 


In the case where a symmetric matrix A has n distinct eigenvalues, the 
fact that A has an orthonormal eigenvector basis follows from the 
following result. 


3 Symmetric matrices 


325 


Unit CA Eigenvectors 


T 


Figure 9 viw=v.w 


326 


Theorem C64 


Eigenvectors corresponding to distinct eigenvalues of a symmetric 
matrix are orthogonal. 


Proof Let A be a symmetric matrix, and let v and w be eigenvectors 
of A corresponding to the distinct eigenvalues A and u. Then 


Av=\v and Aw = uw. 


®. To show that v and w are orthogonal, we need to show that v-w=0. 


We do this by writing vT Aw in two ways and using the fact that 
7 


v!w=v-w. This fact is illustrated in Figure 9. .® 
We have, 
v" Aw = v” (Aw) = v” (uw) = u(v"w) = u(v- w). 
Since A is symmetric, we have AT = A, and therefore that 
vi A=v' AT = (Av). 
It follows that 
v! Aw = (v A)w = (Av)! w = (Av) w = A(vTw) =A(v-w). 
Therefore A(v- w) = u(v - w); thus 
A-u)v-w)=0. 


Since the eigenvalues A and p are distinct, A — u is non-zero, and hence 
v-w=0. The two eigenvectors v and w are orthogonal as required. E 


The following exercises show how Theorem C64 can be used. 


Worked Exercise C71 


Orthogonally diagonalise the symmetric matrix 


-o 


3 Symmetric matrices 


®. Since any eigenvectors corresponding to these eigenvalues are 
orthogonal by Theorem C64, we form an orthonormal eigenvector 
basis of A by taking an eigenvector of magnitude 1 corresponding to 
each of the two distinct eigenvalues. .® 


An eigenvector of magnitude 1 corresponding to À = 7 is 
i il 
eva) 
An eigenvector of magnitude 1 corresponding to À = 3 is 
il 1 
(ava) 
It follows from Theorem C64 that an orthonormal eigenvector basis 
of A is 


aa) (aa) 


We use the eigenvectors in FE to form the columns of the orthogonal 
transition matrix: 


1 1 
v2 v2 
aon Iie 1 


V2 2 


We use the eigenvalues corresponding to the eigenvectors in E to form 
the diagonal matrix: 


Exercise C137 


Orthogonally diagonalise each of the following symmetric matrices. 


5 -1 -1 
(a) Oe E (b) A= a : : 


The eigenvalues in part (b) are À = 6, A = 3 and A= 2. 


327 


Unit C4 Eigenvectors 


328 


So far, in each case where we have orthogonally diagonalised ann x n 
symmetric matrix, we have had n distinct eigenvalues and Theorem C64 
has ensured that the eigenvectors are all orthogonal. We have then formed 
an orthonormal eigenvector basis for the matrix by writing down basis 
vectors of magnitude 1. Where the eigenvalues of the symmetric matrix 
are not all distinct we have to find an orthonormal eigenvector basis for 
each eigenspace — then Theorem C64 will ensure that the resulting set of 
eigenvectors will form an orthonormal eigenvector basis for the matrix. 


The following strategy is a modification of Strategy C21. It reflects the 
fact that we can always find an orthonormal basis comprising r vectors for 
an eigenspace of a symmetric matrix corresponding to an eigenvalue of 
multiplicity r. This result is not proved here. 


Strategy C23 

To find an orthonormal eigenvector basis of a symmetric matrix A: 
1. find an orthonormal basis for each eigenspace of A 

2. form the set E of all the basis vectors found in step 1. 


Then E is an orthonormal eigenvector basis of A. 


Worked Exercise C72 


Orthogonally diagonalise the symmetric matrix 


4 2 2 
A= {2 4 2 
22 4 


To find an orthogonal basis for the eigenspace S(2), we use the 
Gram-Schmidt orthogonalisation process. 
Let the orthogonal basis we seek be {v1, v2}, with vı = (1,0, —1). 
Then 
-(0,1,—1 
= (Oi — v 


Wil 2 WAL 
Get =) (0, 1,51) 
1 1 ioe 
ee eee oo) =) 
= (0,1,-1) — 4 (1,0, -1) 


ii T 2): 


®. Dividing va by |va| = V6/2 gives a unit basis vector. However, 
although it is not necessary it is often helpful to minimise the minus 
signs involved: we can multiply through by —1 to get another unit 
basis vector orthogonal to vı. .® 


An orthonormal basis for $(2) is therefore 


(aa) Gave) 


@. We have ensured that the eigenvectors in the basis for $(2) are 
orthogonal, and by Theorem C64 the eigenvectors corresponding to 
the distinct eigenvalues A = 8 and \ = 2 are orthogonal. .® 


By Theorem C64 an orthonormal eigenvector basis of A is therefore 


{ede} ar ee) 


We use the eigenvectors in E to form the columns of the transition 
matrix: 


1 1 1 

v3 y2 v6 
za 0 : 

EEE v6 
1 1 1 


VE “J VE 
We use the eigenvalues corresponding to the eigenvectors in E to form 
the diagonal matrix: 


PTAP=D= 


S S&S Ce 
orvo 
SSS 


3 Symmetric matrices 


329 


Unit CA Eigenvectors 


330 


The diagonal matrix found here is the same as that found in Worked 
Exercise C70, since the eigenvalues are considered in the same order. The 
difference in the diagonalisation lies in the transition matrix, which in this 
case is orthogonal. 


Exercise C138 


Orthogonally diagonalise the symmetric matrix 


100 
A=|0 2 1 
012 


(In Exercise C134 you found the eigenvalues and eigenvectors of A: that a 
basis for S(3) is {(0,1,1)} and a basis for S(1) is {(1,0,0), (0,1,—1)}.) 


We conclude this subsection by noting that every symmetric matrix can be 
orthogonally diagonalised and conversely that an orthogonally 
diagonalisable matrix is symmetric. However, it is possible to diagonalise 
(but not orthogonally diagonalise) a non-symmetric matrix that has an 
eigenvector basis. 


3.2 Orthogonal matrices 

In this subsection we look at some properties of orthogonal matrices. 
Remember that the columns of an orthogonal matrix form an orthonormal 
basis, not merely an orthogonal basis; that is, the columns are orthogonal 
vectors of magnitude 1. 


We have said that whenever P is an orthogonal matrix we have PT = P~!. 
We now prove this result. 


Theorem C65 


A square matrix P is orthogonal if and only if PT = P-!. 


Proof We know by Theorem C18 in Unit C1 that PTP = I if and only if 
PP’ =I, so PT = P™! if and only if PTP =I. 


®. So we need to show that P is orthogonal if and only if PTP =I. We 
start off by considering the expression PTP. .® 


Let the columns of the matrix P be the column vectors x1, X2,... , Xn- 
Then the rows of the matrix PT are the row vectors x1, X2,...,Xn- 


For each i and j, the (i, j)-entry of PTP is the scalar product of the ith 
row of PT and the jth column of P; that is, x; - Xj. 


So PTP = I if and only if 
Xi- Xj = 0 whenever i Æj and x;-x;=1for each i. 


This is the case precisely when {x1, X2,..., Xn} is an orthonormal basis 
for R”; that is, when P is orthogonal. E 


Several properties of orthogonal matrices follow from Theorem C65. 


Corollary C66 
Let P and Q be orthogonal n x n matrices. Then: 
(a) P-!(= PT) is orthogonal 


( 
( 
( 


b) the rows of P form an orthonormal basis for R” 
c) da P= 
) the product PQ is orthogonal. 


Proof (a) ®. To show that P~! is orthogonal we must show that the 


(a) 


transpose of P~! is the inverse of P-!. ® 

By Theorem C65 we have PT = P~!. Now, 
(P-})7’p-! = (Pp?) P71 = PP! = 1. 

Thus (P-1)7 = (P-1)-1, so P-!(= PT) is orthogonal. 


The rows of P are the columns of PT. The matrix PT is orthogonal 
by part (a), so its columns form an orthonormal basis for R”. Thus 
the rows of P form an orthonormal basis for R”. 


We know that det PT = det P, and PT = P~! by Theorem C65, so 
PTP =1. 


®. By Theorem C14 in Unit Cl we know that 
det(AB) = (det A)(det B) and det AT = det A for square matrices A 
and B of the same size. .® 


Now, 
det(P’P) = (det P?)(det P) = (det P)?, 
but 
det(P’P) = det I = 1, 
so (det P)? = 1. Hence det P = 1. 


The proof of this is left for you to do in Exercise C139. E 


Exercise C139 


Let P and Q be orthogonal n x n matrices. Prove that the product PQ is 
orthogonal. 


(This is part (d) of Corollary C66.) 


3 Symmetric matrices 


331 


Unit C4 Eigenvectors 


RY 


Figure 10 The angle 6 made 
by the vector (a,c) 


332 


To understand why orthogonal diagonalisation is useful — beyond the ease 
of finding the inverse of the transition matrix — we will now look at the 
geometry of orthogonal transition matrices in R? and R°. 


We begin by asking to what transformations of the plane the 2 x 2 
orthogonal matrices correspond. Suppose that 


(0) 


is an orthogonal matrix. Then the vectors (a,c) and (b,d) form an 
orthonormal basis for R? and det P = +1. 


We stated in Subsection 3.2 of Unit C3 that the magnitude of the 
determinant of a matrix of a linear transformation gives the ‘scaling 
factor’. Therefore det P = +1 means that there is no scaling; that is, 
magnitudes are preserved. 


Let 6 be the angle that the unit vector (a,c) makes with the z-axis, as 
illustrated in Figure 10 for the case that (a,c) is in the first quadrant, so 


(a,c) = (cos 6, sin 6). 


Since the unit vector (b, d) is orthogonal to (a,c), we have (a,c) + (b, d) = 0, 
so 


(b,d) = (—sin@,cos@) or (sind, —cos®), 

as illustrated in Figure 11. 
YA 
(— sin 6, cos 0) 


(cos 9, sin 0) 


0 


Sy 


(sin 8, — cos 6) 
Figure 11 The two possible vectors (b, d) orthogonal to the vector (a,c) 
Hence, if det P = +1, then 
cos@ —sind 
a en ao i 
and if det P = —1, then 
cos 0 sin 0 
P= j 
(o 0 —cos 5) 
Now suppose that E = {e1, e2} is an orthonormal basis for R? and that P 


is the orthogonal transition matrix whose columns are formed from the 
coordinates of e; and eg. 


We have just seen that if det P= +1, then 
eı = (cos0,sin6) and e= (- sind, cos 0), 


that is, eı and e2 are the images of the standard basis vectors (1,0) and 
(0,1) under a rotation rg, as illustrated in Figure 12. 


y 
e2 = (— sin 0, cos0) ---4(0, 1) 


Figure 12 A rotation rg 


Similarly, if det P = —1, then e; and eo are the images of the standard 
basis vectors (1,0) and (0,1) under a reflection g9/2, as illustrated in 
Figure 13. 


e> = (sin 6, — cos 8) 


Figure 13 A reflection qg/2 


So if a 2 x 2 orthogonal matrix P is used to represent a linear 
transformation (as opposed to a transition matrix), then the linear 
transformation must be either a rotation or a reflection. 


Similar arguments can be applied to 3 x 3 orthogonal matrices to show 
that linear transformations of R? whose matrices are orthogonal are 
rotations about a line through the origin, reflections in a plane through the 
origin or combinations of these. The orthogonal matrices representing 
rotations of R? are precisely those with determinant +1. 


3 Symmetric matrices 


333 


Unit CA Eigenvectors 


334 


Exercise C140 


Consider the matrix 


00-1 
A=|I0 1 0 
1 0 0 


(a) Verify that this matrix is orthogonal. 


(b) Show that this matrix represents a rotation of R3. 


Let t be a linear transformation from R” to R” with a matrix 
representation that is a symmetric matrix A. In effect, when we 
orthogonally diagonalise A, we are finding a basis for R” for which 


e the matrix of t is diagonal 
e the basis vectors are orthogonal 
e the basis vectors have magnitude 1. 


For R? and R this new basis is simply the standard basis rotated, 
reflected or, for R?, a combination of the two. 


4 Conics and quadrics 


In this section you will classify conics and quadrics using many of the 
techniques you have learned in this book on linear algebra, including 
orthogonal diagonalisation of symmetric matrices. 


You revised conics in Unit A4 Real functions, graphs and conics. 


4.1 Classifying conics 


A non-degenerate conic may be a circle, an ellipse, a parabola or a 
hyperbola. It is said to be in standard position if it is positioned in the 
plane as follows. 


e For a circle: its centre is at the origin. 


e For an ellipse: its axes of symmetry are the x- and y-axes, and its 
largest width is along the z-axis. 


e For a parabola: its axis of symmetry is the z-axis, it passes through the 
origin and its other points lie to the right of the origin. 


e For a hyperbola: its axes of symmetry are the x- and y-axes, and it 
crosses the x-axis. 


A circle may sometimes be considered to be a special type of ellipse, and 
that will be the case throughout this section. 


An ellipse, a parabola and a hyperbola in standard position are illustrated 
in Figure 14. 


YA YA YA 


SY 
8 
SY 


(a) (b) (c) 


Figure 14 Conics in standard position: (a) ellipse (b) parabola and 
(c) hyperbola 


The line joining the vertices of an ellipse is the major axis of the ellipse, 
and the line perpendicular to this through the centre of the ellipse is the 
minor axis of the ellipse. Thus, for an ellipse in standard position, the 
major and minor axes are the x-axis and y-axis, respectively. 


We can define major and minor axes for parabolas and hyperbolas 
similarly. 
e For a parabola, the major axis is the axis of the parabola, and the minor 


axis is the line perpendicular to this through the vertex of the parabola. 


e For a hyperbola, the major axis is the line joining the vertices of the 
hyperbola, and the minor axis is the line perpendicular to this through 
the centre of the hyperbola. 

Notice, in each case the minor axis is parallel to the directrix of the conic. 

(You met the directrix of a conic in Section 5 of Unit A4). 


In this way, the major and minor axes of any conic in standard position 
are the x-axis and y-axis, respectively. 


An ellipse in standard position has equation 


a parabola in standard position has equation 
y? = 4ax 


and a hyperbola in standard position has equation 


2 2 
Z-% =l 
a b2 


4 Conics and quadrics 


335 


Unit C4 Eigenvectors 


BUKEDSDBUBUVTUBUELELENN 
| 


Figure 15 Moving the axes to 
be able to recognise a conic 


336 


Theorem A21 in Unit A4 says that any conic in R? is the set of 
points (x,y) in R? that satisfy an equation of the following form 

Az? + Bry + Cy? + Fr + Gy+ H =0, (6) 
where A, B, C, F, G and H are real numbers, and A, B and C are not all 
zero. This theorem also says the converse: that the set of all points in R? 
whose coordinates (x,y) satisfy an equation of this form is a conic. 
However, such a conic may be degenerate — in this subsection we will only 
be concerned with non-degenerate conics. 


Given the equation of a non-degenerate conic, such as 

x? — dry — 2y? + 6a + 12y + 21 = 0, (7) 
we would like to be able to decide whether it represents an ellipse, a 
hyperbola or a parabola. We know it is not a circle because of the non-zero 
term in xy, but it is too complicated to easily determine more than this. 
Generally, the equations of conics that arise in calculations are not in 
standard position: thus we need some way of determining the nature of a 
conic from its equation. 


In fact, equation (7) represents a hyperbola with centre (1,2), major axis 
y = 2x and minor axis x + 2y = 5. This conic would be easily recognisable 
were we to move the axes of the plane so that they pass through the centre 
and line up with the major and minor axes of the conic, as illustrated in 
Figure 15. 


You will see that we can move the axes of the plane by introducing 
matrices and changing the basis for the plane, then performing a 
translation so that the conic is in standard position with respect to these 
new basis vectors. The conic will then be easily recognisable from its 
equation. 


We will actually be a little less specific with how we move the axes 
mathematically and may not always end up with a conic in standard 
position: the axes may be interchanged or pointing in the opposite 
directions resulting in conics that are reflected or rotated. However, in 
every case the axes will align with the major and minor axes of the conic, 
and the equation will resemble the equation of a conic in standard 
position; we say that such an equation of a conic is in standard form. 


An ellipse and a hyperbola with equations in standard form, but that are 
not in standard position, are illustrated in Figure 16. 


2 2 2 


+ ij YA er, BO 


a b a? b2 


(a) (b) 


Figure 16 Conics not in standard position with equations in standard form: 
(a) ellipse and (b) hyperbola 


8 
SY 


Parabolas with equations in standard form, but not in standard position, 
are illustrated in Figure 17. 


YA x? = day 
a>O 
y? = 4ax 
aU 
T 
2 = day 
a<0 


Figure 17 Parabolas not in standard position with equations in standard 
form 
Introducing matrices 


We first write equation (6) Ax? + Bry + Cy? + Fr + Gy + H = 0 using 
matrices and vectors; that is, in matrix form as 


xTAx+J”x+ H =0, (8) 


where 


_(A 4B 
a=(i5 ae i 


This is possible, since 


T O A 3B ey Ax + ¿By 
xX Arm (i v) (ip oe 1Br+Cy 


= Az? + Bry + Cy? 


| 
RER 
Qs 
<< 
x 

| 
nn 
e 8 
L 


and 


I’x=(F G) (5) 


= Fz + Gy. 


4 Conics and quadrics 


337 


Unit C4 Eigenvectors 


338 


Notice that the matrix A is symmetric; this will be important. 


For example, the conic with equation (7) can be written in matrix form (8) 
with 


1 -2 6 £ 
TEES e Se 


Exercise C141 


For each of the following equations of a conic in standard position, write 


the equation in matrix form and specify the matrices A and J. 
2 2 2 2 


(a) the ellipse Z + = = (b) the hyperbola ar 1 


(c) the parabola y? = 4ax 


Aligning the axes 


The matrix A in the matrix representation (8) is symmetric, so we know 
that we can orthogonally diagonalise this matrix to get PTAP=D 
where P is an orthogonal transition matrix. 

This helps us recognise the conic by aligning the basis vectors with the 
axes of the conic and therefore removing the xy-terms from the equation. 
The columns of P form an orthonormal basis Æ, and P changes 
E-coordinates xg, which we will write in the form x’ = (x’, y’), into 
standard coordinates x = (x,y), so that x = Px’. 


In this way equation (8) becomes 
(Px’)? A(Px’) + JTPx' + H =0, 


which can be rewritten as 
(x’)? (PTAP)x’ + J7Px’ +H =0. (9) 


Now, P? AP = D is a diagonal matrix with diagonal entries Aı and Aa, so 
we have 


(x’)? (PT AP)x’ = (x’)? Dx’ 


=e DG a) C) 
= M (2)? + à2(y')?, 


and therefore there is no x'y'-term in the new equation (9) for the conic. 
Written in the form of equation (9), this now more closely resembles the 
equation of a conic in standard position. The vectors in the orthonormal 
basis E of the plane are aligned with the axes of the conic: we say we have 
aligned the azes. 


The order and direction in which the eigenvectors are chosen affects the 
orthonormal basis Æ and therefore the transition matrix P obtained. 


However, in every case P is an orthogonal matrix and so det P = +1. 
Orthogonal diagonalisation ensures that the new basis vectors are 
orthogonal (perpendicular) and of magnitude 1. If P is considered to 
represent a linear transformation (as opposed to a transition matrix), then 
the linear transformation is either a rotation (det P = +1) or a reflection 
(det P = -1). 

It is sometimes preferable, when choosing the orthonormal basis F, for it 
to be a rotation (rather than a reflection) of the standard basis vectors; 
that is, that P, considered as a linear transformation, is a rotation. This is 
achieved by ensuring that det P = +1 (using either geometric insight, or 
by checking the determinant). However, this step is not required in this 
module. 


We now illustrate the process of rewriting a conic in the form of 
equation (9) by applying the process to equation (7), where 


A=() 7). 


Worked Exercise C73 


Express the non-degenerate conic 


xr? — dey — 2y? + 6x + 12y + 21=0 


in the form of equation (9). 


Solution 


@. The matrix form of the equation of the conic is 
x Ax J x He where 


1 -2 6 E 
n a 


In Exercise C119(b) you found that the eigenvectors of A are the 
non-zero vectors (k, 2k) and (—2k, k), corresponding to the 
eigenvalues A = —3 and X = 2, respectively. 


We start by orthogonally diagonalising A. © 
We use Strategy C22 to orthogonally diagonalise A. 


An orthonormal basis for $(-3) is 


lvl 


and an orthonormal basis for $(2) is 


(vay) 


4 Conics and quadrics 


339 


Unit C4 Eigenvectors 


By Theorem C64 an orthonormal eigenvector basis of A is therefore 


3} 


We use the eigenvectors in E to form the columns of the transition 
matrix: 
il 2 


A 
2 1 
v V5 
@®. Note that det P = +1, so the basis vectors in E are the images of 


the standard basis vectors under a rotation, but that does not concern 
us here. © 


We use the eigenvalues to form the diagonal matrix 


T Ba 
Bae-m-| ) 3 


®. We substitute into (x')T’(PTAP)x’ + J7Px’+H=0. ® 
It follows from equation (9) that the equation of the conic is now 
1 2 


(a y) T À © +(6 12) = ve ©) 491 =0, 
va V5 
that is, 
3(a’)? + 2(y’)? + 6V52' + 21 = 0. 


®. There are no terms in x’y’ in this new equation. ©& 


You might wonder what the equation in Worked Exercise C73 would have 
been if the eigenvalues had been chosen in the opposite order? The next 
exercise investigates this. 


Exercise C142 


Express the non-degenerate conic 


a? — Ary — 2y? + 6x + 12y + 21 = 0 


in the form of equation (9), using the eigenvalues in the order \ = 2 then 
A = —3. 


340 


The equation of the conic with the eigenvalues A = —3 then A = 2 and the 
equation of the conic with the eigenvalues A = 2 then A = —3 are very 
similar. It looks like the roles of x’ and y’ have been interchanged; that is, 
the order of the coordinates have been interchanged, which corresponds to 
interchanging the axes. We have det P = —1 in Exercise C142 so this 
transition matrix corresponds to a reflection of the axes, whereas we have 
det P = 1 in Worked Exercise C73 so this transition matrix corresponds to 
a rotation. 


In general, for any conic, if 


(3) 
then equation (9) is of the form 

Aa)? + Aa(y’)? + fa’ + gy! +H=0, (10) 
where (f g) =J P. 


The equation of the conic in this form has been simplified since it now has 
no x’y’ terms, but is not yet in a form from which we can easily recognise 
the type of the conic: a translation of the axes is also required. 


Translating the origin 


To write the equation of the conic in standard form from which we can 
easily recognise the type of the conic, we need to eliminate any superfluous 
linear x’ and y’ terms. This is achieved by translating the origin using an 
(a, B)-translation and moving to new coordinates x” = (x”,y”): we say we 
have translated the origin. 


To do this, we first complete the squares in the equation of the conic. We 
illustrate this process using the conic with equation (7). We have already 
aligned the axes to obtain the equation 


-3(&')? + 2(y’)? + 6V5a’ + 21 = 0, 
which is equivalent to 
= (eF -— 2/52’) + 2(y')? +21 =0. 


This equation has no linear y’ term, so we only need to complete the 
square involving x’. We obtain 


3(a’ — V5)? +15 + 2(y')? + 21 = 0, 


sO 
—=3(x! — £90)" + 36 = 0. 


In Subsection 1.3 of Unit A4 you saw that applying an (a, 3)-translation to 
the graph of y = f(x) gives the graph of y = f(x — a) + ß, or equivalently, 
y — B = f(x — a). We can express this translated curve more simply by 
using new (x, y')-coordinates obtained by an (a, 3)-translation of the 

(x, y)-axes: we do this by setting € = x — a and y = y — B. In this new 
(x’, y')-coordinate system the equation of the translated curve is y = f(z’). 


4 Conics and quadrics 


341 


Unit C4 Eigenvectors 


342 


For our conic we use a (v5, 0)-translation, so we set the new coordinates 
to be 


x” = (x,y) = (a! Bu V5, y’). 


Thus we rewrite the equation of the conic using these coordinates by 
substituting 


a" =x'— V5 and y" =y', 

which results in the following simplified equation of the conic 
-3(2")? + 2(y")? = -36, 

or 
a)? _ (m? 


12 18 


This equation is now recognisable as the equation of a hyperbola in 
standard form. In fact, it is also a hyperbola in standard position with 
respect to these new axes, since the (x”)? term is positive and the (y”) 
term is negative. 


=]. 


2 


For this conic we have 
e introduced matrices A and J 


e orthogonally diagonalised the matrix A to find the orthogonal transition 
matrix P which rotates the (x, y)-axes by 6 = cos~!(1//5) to get the 
(x’, y’)-axes 

e translated by 5 in the x’ direction to get the (2”, y’’)-axes. 

This is illustrated in Figure 18. 


align the axes translate the origin 


YA 


Figure 18 Moving the axes to get the equation of the conic in standard form 
(A = —3 then A = 2) 


What would the equation of this conic have been if the eigenvalues had 
been chosen in the opposite order? The next exercise investigates this 
using the equation you found in Exercise C142. 


Exercise C143 


Write the equation of the conic 


x? — 4ry — 2y? + 6x + 12y + 21=0 


in standard form by completing the square in the equation 
I(x’)? — 3(y/)? + 6V5y' + 21 =0 


and then making a substitution to get coordinates (a”, y”). 


Figure 19 illustrates how the axes have been moved with the eigenvalues in 
the order À = 2 then A = —3, as in Exercise C143: the axes are reflected 
and then translated. 


align the axes translate the origin 


Figure 19 Moving the axes to get the equation of the conic in standard form 
(A = 2 then à = —3) 


The equations in standard form found for the conic with equation (7) are 


Ny 2 N\ 2 
@) 8) 24, hao hen ha 
2 18 


and 
(> (y)? 
18 12 


In the second case the hyperbola is not in standard position with respect 
to these new axes, since the (x”)? term is negative and the (y”)? term is 
positive. 


=1, for \=2 then à = —3. 


It is clear that the roles of x” and y” have been interchanged. 
Geometrically, the new axes of the plane have been interchanged, so the 
hyperbola has related, but different, equations in relation to these different 
choices of axes. However, both equations are in the standard form for a 
hyperbola, so the choice of the order of the eigenvalues does not affect the 
conclusion that this conic is a hyperbola. 


4 Conics and quadrics 


343 


Unit C4 Eigenvectors 


344 


Ellipse and hyperbola 


In general, if neither eigenvalue is 0, then completing the squares in 
equation (10) gives an equation of the form 


wae au i. Oy g \? 
alt) lan) +2 (¥+a55) ~* (a) +9 =0 


which can be written as 
Aula”)? di Aaly")? = K, 


where 


1" N g 
<— = — d K=—+4-——-dH 
M es. a A 
Writing the equation in standard form gives 
(x")? (y")? 
Kia) Kia ° 


which is the equation of an ellipse if both K/Aı and K/Aa are positive, and 
a hyperbola if one is negative and the other positive. (No other possibility 
can occur, although we do not explicitly show this.) 


Parabola 


In general, if one eigenvalue is 0, say Aı is 0 and Aa Æ 0, then equation (10) 
has the form 


aly’)? + fa’ + gy’ +H =0. 


Completing the square in this equation gives 


2 2 
fel +ro(y' +) -w() +H=0, 
2 2r2 


which can be written as 
daly")? + fa" = 0, 


where 


2 
"oy g no 1_ AQ g H 
a a et ae eee 


Writing the equation in standard form gives 


which is the equation of a parabola. 


If Ay #0 and Aa is 0, then we obtain the similar equation 


which is also the equation of a parabola. 


Summarising the method 


There are several steps involved in writing the equation of a conic in 
standard form, so we summarise this method in the following strategy. 


Strategy C24 


To write the non-degenerate conic with equation 
Az? + Bry + Cy?+ Fx+Gy+H =0 


in standard form, do the following. 
1. Introduce matrices: 


1 
e write down A = Cs 2) and) = fal 


2. Align the axes: 
e orthogonally diagonalise A to get 


T Ià 0 
PTaP= (4 > 


e find ( if g) = JTP, and write the conic in the form 
ale) e e ar AT 
3. Translate the origin: 


e complete the squares 
e make a substitution to change to the coordinate system (x”,y”). 


The order in which the eigenvalues are chosen does not affect the form of 
the equation obtained: it will be the standard form for an ellipse, a 
hyperbola or a parabola. 


The following worked exercise and exercises illustrate this strategy. 


Worked Exercise C74 


Use Strategy C24 to write the non-degenerate conic with equation 


52? + dey + 5y? + 202 +8y-1=0 


in standard form. Is this conic an ellipse, a parabola or a hyperbola? 


4 Conics and quadrics 


345 


Unit C4 Eigenvectors 


2. Align the axes. 
@. We orthogonally diagonalised A in Worked Exercise C71. © 


We have 
| 7 
where 
ee 
2 y2 
iR AN 
era 
so zit ai 
(fF g)=(20 8) |? © 
v2 v2 
(2 2) 
JVZ A 
= (1272916172) 


The equation of the conic is now 
T(x)? + 3(y')? + 14V22' + 6V2y — 1 = 0. 
3. Translate the origin. 


®. To keep track of the terms when completing the square, we first 
collect the x’ terms and the y’ terms. We take out the coefficients 
of (x')? and (y’)? as factors. ® 


We write this equation as 
7 (@)? + 2V2) +3 (W)? + 2v2y') -1=0. 
Completing the squares in this equation, we obtain 
We DV Dad; ee 


We substitute x” = a! + V2 and y" = y' + V2 into this equation 
and simplify to obtain 


MY A = =A, 
The equation of the conic in standard form is 
N\2 N2 
w o 
3 7 
The conic is an ellipse. 


= ll. 


@®. We can see that this ellipse is not in standard position with 
respect to these new axes since 3 < 7. .® 


346 


Exercise C144 


Use Strategy C24 to write the non-degenerate conic with equation 


9x? — dry + 6y? — 10x — 20y —5 = 0 


in standard form. Is the conic an ellipse, a parabola or a hyperbola? 


(In Exercise C137(a) you found that 


e(a) h) 


is an orthonormal eigenvector basis for the matrix A of this conic with 
respect to the eigenvalues A = 10 and A = 5.) 


Exercise C145 


Use Strategy C24 to write the non-degenerate conic with equation 


x? — Any + 4y? — 6x — 8y +5 = 0 


in standard form. Is the conic an ellipse, a parabola or a hyperbola? 


4.2 Classifying quadrics 


Quadrics, or quadric surfaces, are surfaces in RÌ. They are the 
three-dimensional analogues of conics. 


Definition 
A quadric in R? is the set of points (x,y,z) that satisfy an equation 
of the form 


Ag? + By? + C2? +Fay+Gyz + Hxzz+Jc+Ky+Lz+M=0, 


where A to M are real numbers, and A, B, C, F, G and H are not 
all 0. 


In general the situation is more complicated than for conics and the general 
situation is beyond the scope of this module. However, it can be shown 
that there are nine types of quadrics involving curved surfaces in R3. Each 
of these types can be positioned in space to be in standard position; 
that is, with its axes aligned with the x-, y- and z-axes in a similar manner 
to the non-degenerate conics. These quadrics in standard position have 
easily recognisable equations and the different types can be distinguished 
by the curves of intersection of the planes parallel to the coordinate 
planes that meet the quadric in a non-trivial intersection. Figure 20 shows 
some curves of intersection for a sphere — they are all circles. 


4 Conics and quadrics 


Figure 20 Some curves of 
intersection of a sphere 


347 


Unit C4 Eigenvectors 


348 


The curves of intersection of a non-degenerate quadric are 
non-degenerate conics. There are five types of non-degenerate quadric: 


e the ellipsoid (which includes the sphere) 
e the elliptic paraboloid 

e the hyperbolic paraboloid 

e the hyperboloid of one sheet 

e the hyperboloid of two sheets. 


Table 1 illustrates each of these quadrics and gives the equation in 
standard position, as well as specifying the curves of intersection. 


There are four types of degenerate quadric involving curved surfaces: 
e the elliptic cone 

e the elliptic cylinder 

e the parabolic cylinder 

e the hyperbolic cylinder. 


The curves of intersection of these include non-degenerate conics, 
degenerate conics and pairs of parallel lines. The elliptic cone in standard 
position is illustrated in Table 1, where the equation is given and the 
curves of intersection specified. The elliptic cone can be considered as 
intermediate between the hyperboloids of one and two sheets — where the 
two sheets touch at a point. The three types of cylinder in standard 
position, illustrated in Figure 21, are surfaces whose equations do not 
involve z explicitly. 


Figure 21 Degenerate quadrics: (a) elliptic cylinder (b) parabolic cylinder 
and (c) hyperbolic cylinder 


The only degenerate quadrics we will consider for the remainder of the 
linear algebra topic are elliptic cones, thus giving the following list of six 
quadrics, all included in Table 1: the ellipsoid (including the sphere), the 
elliptic paraboloid, the hyperbolic paraboloid, the hyperboloid of one 
sheet, the hyperboloid of two sheets and the elliptic cone. 


Table 1 Quadrics: equation in standard position and the curves of intersection 


Ellipsoid 
2 2 2 
aE 
V e 


curves of intersection: 
ellipse 


Elliptic paraboloid 


a y? 


a b2 
curves of intersection: 
ellipse or parabola 


— 


Hyperbolic paraboloid 


r2 y? 


a » 
curves of intersection: 
hyperbola or parabola 


Hyperboloid of one 
sheet 


curves of intersection: 
ellipse or hyperbola 


Hyperboloid of two 
sheets 


ay 2 7 
atp eo 
curves of intersection: 
ellipse or hyperbola 


Elliptic cone 


curves of intersection: 
ellipse or hyperbola 
(or a degenerate conic) 


4 Conics and quadrics 


349 


Unit CA Eigenvectors 


MON SMH. 


Gaspard Monge 


Jean Nicolas Pierre Hachette 


350 


The first systematic classification of quadric surfaces was by Leonhard 
Euler (1707-1783) in his celebrated Introductio in analysin 
infinitorum (1748) — the textbook in which he laid down the 
foundations of analysis — where he treated surfaces of second degree as 
a family of quadrics in space analogous to the plane conic sections. 
The subject was developed in a more rigorous way by Gaspard Monge 
(1746-1818) and Jean Nicolas Pierre Hachette (1769-1834) who, in 
1802, provided an algebraic study of quadric surfaces, which was later 
published as a textbook. Both Monge and Hachette were professors at 
the famous Ecole Polytechnique in Paris. This college was founded at 
the end of the nineteenth century to provide students with a 
mathematical and scientific education, and to prepare them for entry 
to the prestigious Grandes Ecoles, higher education establishments for 
the training of civil and military engineers. 


As with conics, to identify a given quadric from its equation, we will align 
the axes and translate the origin to obtain an equation that resembles the 
equation of a quadric in standard position: we say that such an equation of 
a quadric is in standard form. So, for example, the equation 


r2 y? z2 


a P'e 
is an equation of a hyperboloid of one sheet in standard form, although it 
is not in standard position. 


=l 


To write the equation of a quadric in standard form, we use the same 
techniques that we used for conics: introducing matrices, orthogonal 
diagonalisation and completing the square. We omit the justification — it is 
analogous to that for conics. 


We summarise this method in the following strategy. 


Strategy C25 
To write the quadric with equation 
Az? + By? + Cz? 4+ Fay+ Gyz + Hxz+Jc+Ky+Lz+M=0 
in standard form, do the following. 
1. Introduce matrices: 


e write down the matrices 


A Ie gë J 

A=|3F B 3G| and J=|K 
1 il L 
5H 3G C 


2. Align the axes: 
e orthogonally diagonalise A to get 


MO 0 
PTAP=-|0 % 0 
ee 


e find (f g h) =J’P, and write the quadric in the form 
MET Aa) Nele)” == Ee) na 
3. Translate the origin: 


e complete the squares 


e make a substitution to change to the coordinate system 
(ae ie ee 


The following worked exercise and exercises illustrate this strategy. 


Worked Exercise C75 


Use Strategy C25 to write the quadric with equation 
5x? + By? + 32? — 2xy + 2yz — 2az — 10x + 6y — 2z — 9 = 0 


in standard form. Which of the six types of quadric does this represent? 


Solution 


®. As with conics, since some parts of this working can be quite long, 
we number the strategy steps in the solution. .® 


1. Introduce matrices. 


We have 
5 —1 -1 —10 
A=|-1 3 1 and J= 6 
—1 il 3} —2 


2. Align the axes. 
®. You orthogonally diagonalised A in Exercise C137(b). .® 
We have 


6 0 0 
PTAP=10 3 0ļ, 
OO) 2 


where 


0 
1 
v2 
1 
v2 


S-S- Sl 
Sl- Sl- hl- 


4 Conics and quadrics 


351 


Unit C4 Eigenvectors 


®. Since det P = 1, this transition matrix represents a rotation of 
the basis vectors, but this fact does not concern us here. .® 


So 


= (4v6 -243 4v2). 
The equation of the quadric is now 
Oe) EN 2902, VO Day, dy = 0 = 0) 
3. Translate the origin. 


We write this equation as 
4 2 
6 +) Vz) 
(e+ w- y 
+2 (2? + 2/27) =9=0. 


Completing the squares in this equation, we obtain 


ae es 
(+) -443(/- =) -1 
+A ty Ag =): 
Substituting 


1 
i af y! =y-—= and M= 


v3 
in this equation and simplifying, we obtain 
ba ae 
The equation of the quadric in standard form is 


(ae m (y’)? (Ze = 
3 6 9 i 


This is the equation of an ellipsoid. 


Exercise C146 


Use Strategy C25 to write the quadric with equation 
a? +y? + 27 — 24+ 4y—6z-—11=0 
in standard form. Which of the six types of quadric does this represent? 


352 


Summary 


Exercise C147 


Use Strategy C25 to write the quadric with equation 
Ag? + 3y? + 227 + day + 4yz + 12x + 122 +18 = 0 
in standard form. Which of the six types of quadric does this represent? 


(At the start of Subsection 3.1 we found that 
_f(2 21 212) (1 22 
E = {(3, +3) »(—3,3> 3) (35-33) $ 
is an orthonormal eigenvector basis for the matrix A of this quadric with 
respect to the eigenvalues \ = 6, A = 3 and A = 0.) 


Summary 


In this unit you have met eigenvectors and eigenvalues: an eigenvector of a 
linear transformation t : V — V is a non-zero vector v that is mapped by t 
to a scalar multiple of itself, and this scalar is the corresponding 
eigenvalue A. Since such a linear transformation always has a square 
matrix representation, you have seen that eigenvectors and eigenvalues can 
equivalently be defined in terms of matrices: Av = Av. You have found 
eigenvalues and eigenvectors by solving the corresponding characteristic 
equation det(A — AI) = 0. You have seen that there may be no 
eigenvalues, for example when t is a rotation of R?, and that all the 
eigenvectors corresponding to a given eigenvalue A, plus the zero vector, 
form a subspace S(A) of V whose dimension is never greater than the 
multiplicity of the eigenvalue. 


You have investigated when t has an eigenvector basis FÆ; that is, a basis 
comprising only eigenvectors of t, and you have met transition matrices P 
that map a basis E of V to the standard basis. You have seen 

(Theorem C60) that the transition matrix P maps standard coordinates 
of V to E-coordinates of V and that P is invertible. You have learned 
(Theorem C62) that whenever an eigenvector basis can be found, the 
transition matrix P can be used to express the matrix A of t (with respect 
to the standard basis) as a diagonal matrix (with respect to this 
eigenvector basis) via the relation D = P~'AP. Furthermore, when t has 
a symmetric matrix representation, the eigenvectors corresponding to 
different eigenvalues are orthogonal (Theorem C64), and an eigenvector 
basis can always be found. In addition, the basis vectors can be chosen to 
give an orthonormal eigenvector basis so that the transition matrix is an 
orthogonal matrix satisfying PT = P~!, giving D = P7 AP. 


353 


Unit C4 Eigenvectors 


354 


Thus diagonalising matrices involves the main ideas you have studied 
throughout this book on linear algebra: vectors, matrices, vector spaces, 
bases and linear transformations. 


In the final section you have seen how these techniques can be used to 
identify the type of a conic, or quadric, from its equation. 


Learning outcomes 


After working through this unit, you should be able to: 


e explain the meaning of the terms eigenvalue, eigenvector, characteristic 
equation and eigenspace 


e recognise the geometric interpretation of eigenvectors and eigenspaces in 
special cases 


e find the eigenvalues and eigenvectors of a given 2 x 2 or 3x 3 matrix 
e describe some basic properties of eigenvalues and eigenvectors 


e write down the matrix of a linear transformation t with respect to a 
given eigenvector basis of t 


e write down the transition matrix from an eigenvector basis to the 
standard basis 


e diagonalise a given square matrix, if possible 

e understand that any symmetric matrix can be orthogonally diagonalised 
e orthogonally diagonalise a given symmetric matrix 

e describe some basic properties of orthogonal matrices 


e write the equation of a given non-degenerate conic in standard form and 
hence classify it 


e understand the term quadric and recognise the six types of quadric 
covered 


e write the equation of a given quadric in standard form and hence 
classify it. 


Solutions to exercises 


Solution to Exercise C115 


We have 
t(2,—2) = (2 — 8,2 +4) = (—6, 6) 
= -3(2,—2) 
and 
t(—7,7) = (—7 + 28, —7 — 14) = (21, —21) 


= -3(-7,7). 


In each case the original vector is scaled by the 
factor —3. 


Solution to Exercise C116 

(a) We have t(0,1) = (4, —2), t(1,2) = (9, —3) and 
t(4,1) = (8,2). 

(b) The linear transformation t maps the line 
joining the points (0,0) and (4,1) to the line 
joining the points (0,0) and (8,2). But 

(8,2) = 2(4, 1), so these lines are the same and 
both can be written as x = 4y. Therefore the line 
x = Ay is mapped to itself by the linear 
transformation t. 


(c) We have 
t(4k, k) = (4k + 4k, 4k — 2k) = (8k, 2k) 
= 2(4k, k), 


so any vector lying along the line x = 4y is scaled 
by the factor 2. 


Solution to Exercise C117 


(a) A reflection t in the line y = x maps the point 
(x,y) to the point (y, x). Each point on the line 
y = x is mapped to itself, since 


t(k, k) = (k, k) = 1(k, k), 
so the non-zero vectors (k, k) are eigenvectors with 
corresponding eigenvalue 1. 


Each point on the line y = —x is mapped to 
another point on the line y = —2, since 


t(k,—k) = (-k,k) = —1 (k, —k), 


so the non-zero vectors (k, —k) are eigenvectors 
with corresponding eigenvalue —1. 


Solutions to exercises 


(b) A 2-dilation t maps the point (x,y) to the 
point (2x, 2y). Every line through the origin is 
mapped to itself; that is, every non-zero vector in 
the plane is an eigenvector of t. Let k and l be real 
numbers which are not both zero. Then 


t(k, 1) = (2k, 21) = 2(k, 1), 


so the non-zero vectors (k,l) are eigenvectors with 
corresponding eigenvalue 2. 


(c) An anticlockwise rotation t through 7/2 maps 
the point (x,y) to the point (—y, x). No line 
through the origin is mapped to itself by t, so t has 
no eigenvectors. 

(d) An anticlockwise rotation t through 7 maps 
the point (x,y) to the point (-x,—y). Each line 
through the origin is mapped to itself; that is, each 
non-zero vector in the plane is an eigenvector of t. 


Let k and l be real numbers that are not both zero. 
Then 


t(k,l) = (—k, —l) = -1(k, 0), 


so the non-zero vectors (k,l) are eigenvectors with 
corresponding eigenvalue —1. 


Solution to Exercise C118 


(a) We wish to find those vectors (x,y) that are 
mapped to scalar multiples of themselves; that is, 
the vectors that satisfy 


(-5x + 3y, 6x — 2y) = (Az, Ay). 
Equating coordinates, we obtain the system 
—5a2 + 3y = Ax 
6x — 2y = Ay, 
which we write as 


(-—5 —A)x+ 3y =0 
6x + (-2—A)y = 0. 
(b) Non-zero solutions to the eigenvector 


equations exist if and only if the determinant of 
the coefficient matrix is 0; that is, if and only if 


2 3 
ae 


355 


Unit C4 Eigenvectors 


We expand the determinant and obtain 


Her à) = 18 = 0, 


which simplifies to 
AX +7 —-8=0. 


The eigenvalues of t are the solutions to this 
characteristic equation. We have 


A? +7A—8=(A—1)(å+8)=0, 
so the eigenvalues are À = 1 and A = —8. 


(c) To find the corresponding eigenvectors, we 
consider each value of X in turn. 


The eigenvector equations become 


-62+3y=0 
6x — 3y = 0. 
These equations are equivalent to the single 
equation 
2x —y=0. 


Thus the eigenvectors corresponding to À = 1 
are the non-zero vectors (x, y) for which 
y = 2x; that is, the vectors of the form 


(k,2k), where k 40. 
A=-8| The eigenvector equations become 
3x + 3y = 0 
6x + 6y = 0. 
These equations are equivalent to the single 
equation 
z+y=0. 
Thus the eigenvectors corresponding to 
A = —8 are the non-zero vectors (x, y) for 
which y = —2; that is, the vectors of the form 
(k,—k), where k £0. 


Thus the eigenvectors of t are the non-zero vectors 
of the following forms: 


(k, 2k), corresponding to A = 1, 
(k, —k), corresponding to \ = —8. 


Solution to Exercise C119 


(a) The matrix of t with respect to the standard 
basis for R? is 


a-h; Ei 


356 


We use Strategy C18 to find the eigenvalues and 
eigenvectors of A, which are the same as those of t. 


First we find the eigenvalues of A. 
The characteristic equation of A is 
det(A — AI) = 0; that is, 


l1—à 3 


E E 


We expand the determinant and obtain 


(1—A)(—4— A) 


6 =0, 
which simplifies to 
M+3A—10 = (A— 2). +5) =0. 
The eigenvalues of A are therefore \ = 2 and 
à= 5. 
Next we find the eigenvectors of A. 


The eigenvector equations are 


(1— A)z + 3y =0 
2r + (—4 — à)y = 0. 


The eigenvector equations become 
—x + 3y =0 
2x = 0y = 0. 
These equations are equivalent to the single 
equation 


z— 3y=0. 


Thus the eigenvectors corresponding to A = 2 
are the non-zero vectors for which x = 3y; 
that is, the vectors of the form 


(3k,k), where k £0. 
A=-5| The eigenvector equations become 
6x + 3y = 0 
2x+ y=0. 
These equations are equivalent to the single 
equation 
2e+y=0. 
Thus the eigenvectors corresponding to 
A = —5 are the non-zero vectors for which 
y = -2x; that is, the vectors of the form 
(k,—2k), where k #0. 


Thus the eigenvectors of t are the non-zero vectors 
of the following forms: 

(3k, k), corresponding to A = 2, 

(k, —2k), corresponding to A = —5. 


(b) The matrix of t with respect to the standard 
basis for R? is 


a E: 


We use Strategy C18 to find the eigenvalues and 


eigenvectors of A, which are the same as those of t. 


First we find the eigenvalues of A. 
The characteristic equation of A is 
det(A — AI) = 0; that is, 


E Z9 


_2 Pe 


We expand the determinant and obtain 


ee 


which simplifies to 

A? +X-6 = (A—2)(A+3) =0. 
The eigenvalues of A are therefore A = 2 and 
A= —3. 
Next we find the eigenvectors of A. 
The eigenvector equations are 

(1—A)a - 2y = 0 

22 + (—2 — Ajy =0. 


The eigenvector equations become 


—x% — 2y=0 
22: —4y=0. 
These equations are equivalent to the single 
equation 
x+y =Q. 


Thus the eigenvectors corresponding to À = 2 
are the non-zero vectors for which x = —2y; 
that is, the vectors of the form 


(—2k, k), where k #0. 


A=-3| The eigenvector equations become 
de — 2y = 0 
-22+ y=0. 


Solutions to exercises 


These equations are equivalent to the single 
equation 


2 =y = 0. 


Thus the eigenvectors corresponding to 
A = —3 are the non-zero vectors for which 
y = 2x; that is, the vectors of the form 


(k,2k), where k £0. 


Thus the eigenvectors of t are the non-zero vectors 
of the following forms: 


(—2k, k), corresponding to À = 2, 
(k, 2k), corresponding to \ = —3. 


Solution to Exercise C120 


The matrix of t with respect to the standard basis 
for R? is 


4 2 0 
A= [2 3 2 
0 2 2 
We use Strategy C18 to find the eigenvalues and 
eigenvectors of A, which are the same as those of t. 


First we find the eigenvalues of A. 


The characteristic equation is det(A — AI) = 0; 
that is, 


4—x 2 0 
2 3-A 2 |=0. 
0 2 2—A 


We expand the determinant and obtain 


3— à 2 2 2 


(4—A) 3 2-1 2lo Be 


|+0=0. 


Simplifying this expression, we obtain 


(4 = A)((3 = A) (2 = A) — 4) — 2(2(2 — A)) = 0, 


or 
dA? — 9d? + 18\ = 0. 


There is no constant term, so we take out the 
factor A, then factorise the remaining quadratic 
factor: 


AO? — 9A + 18) =A0=6)A=3)=0. 


The eigenvalues of A are therefore \ = 0, \ = 6 
and A= 3. 


357 


Unit C4 Eigenvectors 


(As a quick check 4 + 3 + 2 = 9 = 6 + 3 + 0, so the 
sum of the eigenvalues is indeed equal to the sum 
of the diagonal entries.) 


Next we find the eigenvectors of A. 


The eigenvector equations are 


(4—A)a + 2y = 
2x + (3 -A)y 22 =) 
2y+(2-A)z= 
The eigenvector equations become 
—2u + 2y =0 
2x = 3y + 2z =0 
2y —4z=0. 


The first and third equations imply that 

x = y and y = 2z, so x = 2z. These satisfy 
the second equation. Thus the eigenvectors 
corresponding to the eigenvalue A = 6 are the 
non-zero vectors (x,y,z) satisfying y = 2z 
and x = 2z; that is, the vectors of the form 


(2k,2k,k), where k £0. 
The eigenvector equations become 


xu + 2y =0 
2x +27: =(0 
2y— z=0. 


The first and second equations imply that 

x = —2y and z = —x, so z = 2y. These 
satisfy the third equation. Thus the 
eigenvectors corresponding to the eigenvalue 
A = 3 are the non-zero vectors (x,y,z) 
satisfying x = —2y and z = 2y; that is, the 


vectors of the form 
(—2k,k,2k), where k £0. 


The eigenvector equations become 


4x + 2y =0 
22 + 3y + 2z =0 
2y + 2z=0. 


The first and third equations imply that 

y = —2x and z = —y, so z = 2x. These 
satisfy the second equation. Thus the 
eigenvectors corresponding to the eigenvalue 
A = 0 are the non-zero vectors (x,y,z) 


358 


satisfying y = —2x and z = 22; that is, the 
vectors of the form 


(k,—2k,2k), where k £0. 


Thus the eigenvectors of t are the non-zero vectors 
of the following forms: 
(2k, 2k,k), corresponding to A = 6, 
(—2k,k,2k), corresponding to A = 3, 
(k, —2k,2k), corresponding to A = 0. 


Solution to Exercise C121 
(a) Let 


a 


The characteristic equation is det(A — AI) = 0; 
that is, 


1—à 2 


0 6-\ 72 


We expand the determinant and obtain 
(1—A)(6— A) —0=0. 


The eigenvalues of A are therefore \ = 1 and 
A = 6. Notice that these are the diagonal entries of 
the upper triangular matrix A. 


(b) Let 
8&8 0 0 
A={0 -5 0 
0 O 21 


The characteristic equation is det(A — AI) = 0; 
that is, 


8—A 0 0 
0 =h=A 0 
0 0 21-A 


= 0. 


We expand the determinant and obtain 


-5—-A 0 


0 Pe 


(8— A) 


Simplifying this expression, we obtain 
(8 — A)((—5 — A)(21 — A) — 0) = 0. 


The eigenvalues of A are therefore \ = 8, A = —5 
and A = 21. Again, these are the diagonal entries 
of the diagonal matrix A. 


(c) Let 
4 0 0 
A=[25 -2 0 
17 m 6 


The characteristic equation is det(A — AI) = 0; 
that is, 


4—xX 0 0 
25 —2— À 0 
17 T 6—A 


= 0. 


We expand the determinant and obtain 


—2— À 0 


(4-A) PN 0. 
Simplifying this expression, we obtain 
(4 — A)((—2 — A)(6 — A) — 0) = 0. 


The eigenvalues of A are therefore \ = 4, \ = —2 
and A = 6. Again, these are the diagonal entries of 
the lower triangular matrix A. 


Solution to Exercise C122 


The non-zero vectors of the form 
(2k, 2k,k) are the eigenvectors of t 
corresponding to A = 6. The eigenspace $(6) 
is therefore the set of vectors 


{(2k, 2k, k) : k € R}. 
Any vector in 5(6) can be written as 
k(2,2,1), so {(2,2,1)} is a basis for $(6). 
Thus S(6) has dimension 1. 


The non-zero vectors of the form 
(—2k,k,2k) are the eigenvectors of t 
corresponding to À = 3. The eigenspace $(3) 
is therefore the set of vectors 


{(—2k,k, 2k) : k € R}. 


Any vector in $(3) can be written as 
k(—2,1,2), so {(—2,1,2)} is a basis for $(3). 


Thus $(3) has dimension 1. 


Solution to Exercise C123 


The matrix 


1 1 
A=|0 4 0 
0 0 


Solutions to exercises 


is triangular, so the eigenvalues are the diagonal 
entries À = 1, à = 4 and A= 4. 
The eigenvector equations are 

(1— A)r + = 
(4—A)y 


The eigenvalue \ = 1 has multiplicity 1. 


The eigenvector equations become 


y- z=0 
3y =0 
3z= 0. 


The second and third equations give y = 0 
and z = 0, respectively, which satisfy the first 
equation. (They give no constraint on x.) 


Thus the eigenvectors corresponding to the 
eigenvalue A = 1 are the vectors of the form 
(k,0,0), where k 4 0. 


The eigenspace S(1) is the set of vectors 
{(k,0,0) :k E€ R}. 


Any vector in S(1) can be written as 
k(1,0,0), so 


{(1, 0, 0)} 
is a basis for S(1). 
Thus S(1) has dimension 1. 
(Geometrically, S(1) is the x-axis.) 
The eigenvalue \ = 4 has multiplicity 2. 


The eigenvector equations become 


The first equation gives z = y — 3x and the 
second and third give no constraints on y 
and z. 


Thus the eigenvectors corresponding to the 
eigenvalue A = 4 are the vectors of the form 
(k, 1,1 — 3k), where k and l are not both 0. 


The eigenspace S(4) is the set of vectors 
{(k, 1,1 — 3k) : k,l € R}. 


359 


Unit C4 Eigenvectors 


Any vector in $(4) can be written as 
k(1,0, —3) + (0, 1,1), so 
{(1,0,-3), (0, 1, 1)} 
is a basis for $(4). 
Thus S$(4) has dimension 2. 


(Geometrically, S(4) is the plane in R® 
—3z +y — z =Q.) 


An alternative solution comes from using the 


equivalent equation x = $(y — z), and has 


basis 
ann nz 
Solution to Exercise C124 


The matrix 
11 
a=(0 ı) 
is triangular, so the eigenvalues are the diagonal 
entries à = 1 and à= 1. 
The eigenvector equations are 
(1—A)a+ y=0 
(1—A)y =0. 
The eigenvalue A = 1 has multiplicity 2. 
The eigenvector equations become 
Or + y=0 
Oy = 0. 


Thus y = 0 and there are no constraints 

on x. Thus the eigenvectors corresponding to 
the eigenvalue A = 1 are the vectors of the 
form (k,0), where k £0. 


The eigenspace S(1) is the set of vectors 
{(k,0):k € R}. 


Any vector in S(1) can be written as k(1,0), 
so 


{(1,0)} 
is a basis for S(1). 
Thus S(1) has dimension 1. 
(Geometrically, S(1) is the z-axis in R?.) 


360 


Solution to Exercise C125 
Let 


1-10 
A= 1 4 1 
—1 1 4 
The characteristic equation is det(A — AI) = 0; 


that is, 


1-X -i 0 
1 4— x 1 
= 1 4—x 


= 0. 


We expand the determinant and obtain 


AN 1 1 1 
ey ara et 
This simplifies to 
(1 — d)((4— A)? -— 1) + ((4-A) +1) =0. 


Using the relation x? — 1 = (x — 1)(x + 1), where 
x = 4 — å, this simplifies further to 


(1 — A)(3 — A)(5— A) + (5 — A) =O, 
and thus 
(5 —A)((1 — A)(B — A) + 1) = (5 — A)(A? — 4A + 4) 
= (5 — A)(A— 2)? 


The eigenvalues of A are À = 5, A= 2 and àA = 2. 


(As a quick check 1 + 4 + 4 = 9 = 5 + 2 + 2, so the 
sum of the eigenvalues is indeed equal to the sum 
of the diagonal entries.) 


The eigenvector equations are 
(1-A)z — y =0 
x+ (4— Ay + z=0 
TL + y + (4—A)z =0. 
The eigenvalue A = 5 has multiplicity 1. 


The eigenvector equations become 


—4x — y =0 
2 - y+z=0 
-zr +y- z=0. 


The first equation gives y = —4r and 
substituting this into the second gives 

5a + z = 0, which implies that z = —5x. The 
third equation is equivalent to the second. 


Thus the eigenvectors corresponding to the 
eigenvalue A = 5 are the vectors of the form 
(k, —4k, —5k), where k £ 0. 


The eigenspace S(5) is the set of vectors 
{(k, —4k, —5k): k E R}. 

Any vector in $(5) can be written as 

k(1, —4, —5), so 
{(1, 4, -8)} 

is a basis for $(5). 

Thus $(5) has dimension 1. 

The eigenvalue A = 2 has multiplicity 2. 
The eigenvector equations become 


—“Z- y =0 
z+2y+ z=0 
-xr + yt2z=0. 


The first equation gives y = —x and 
substituting this into the second gives 

—a + z = 0, which implies that z = x. These 
satisfy the third equation. 


Thus the eigenvectors corresponding to the 
eigenvalue A = 2 are the vectors of the form 
(k, —k, k), where k £0. 


The eigenspace S(2) is the set of vectors 
{(k, —k,k):k €R}. 


Any vector in $(2) can be written as 
k(1,—1,1), so 


{(, = 1, 1)} 
is a basis for $(2). 
Thus $(2) has dimension 1. 


Solution to Exercise C126 


Letting k = 1, we see that (—2,1) and (1,2) are 
eigenvectors of t. Since (1,2) is not a multiple of 


(—2,1), these two eigenvectors form a basis for R?. 


Solution to Exercise C127 


Each of the vectors in E is an eigenvector of t: 
t(0, L —1) z (0,0, 0) u 0(0, 1; —1), 
t(—2,1,0) = (4, -2,0) = -2(-2,1,0), 
t(1,0,-1) = (—3, 0,3) = —3(1,0, —1). 


Solutions to exercises 


Thus Æ is a basis for R? consisting of eigenvectors 
of t; that is, E is an eigenvector basis of t. 


Solution to Exercise C128 


(a) The matrix of t with respect to the standard 
basis for R? is 


1 -2 
—2 -2)° 
(b) Following Strategy C19, first we find the 
images of the vectors in the basis 
E = {(—2,1), (1, 2)}: 
t(—2,1) = (-4,2), (1,2) = (-3,-6). 
Next we find the E-coordinates of each of these 
image vectors: 
(-4, 2) = 2(-2, 1) + uf, 2) 
= (2,0)5, 
(8, —6) = O(—2, 1) u 3(1, 2) 
= (0, -3)E- 
Therefore t(-2,1) = (2,0)z and t(1,2) = (0, —3)z. 
So the matrix of t with respect to the eigenvector 
basis E is 


2 0 
0 -3/° 
Solution to Exercise C129 
In Exercise C127 you showed that 
t(0, i —1) = 0(0, 1, —1), 
t(—2, 1,0) = —2(—2, 1,0), 
t(1,0,—1) = —3(1,0, —1). 


So the eigenvalues of t are A, = 0, Ag = —2 and 
A3 = —3, and, by Theorem C59, the matrix of t 
with respect to E is 


0 0 0 
0-2 0 
00-3 


Solution to Exercise C130 


wr-(3) 


0 -2 1 
(b)P=/{ 1 1 0 
—1 0 -1 


361 


Unit C4 Eigenvectors 


Solution to Exercise C131 the following forms: 
Let t : R? —> R? be the linear transformation (2k, 2k,k), corresponding to A = 6, 
given by (—2k,k,2k), corresponding to A = 3, 


t(x,y) = (x — 2y, —2x — 2y) (k, —2k,2k), corresponding to A = 0. 
and let E be the eigenvector basis {(—2, 1), (1,2)} 
of t. It follows from Exercise C128 that A is the 
matrix of t with respect to the standard basis for 
R? and D is the matrix of t with respect to the 


It follows from Theorem C63 that we can form an 
eigenvector basis of A by taking one eigenvector 
corresponding to each of the three distinct 
eigenvalues. For example, 


eigenvector basis Æ. By Theorem C62, E = {(2,2, 1), (—2, 1, 2), (1, —2, 2)} 

D = P’!AP, where P is the transition matrix A . : 

from E to the standard basis for R?; that is, is an eigenvector basis of A. 

91 We use the eigenvectors in E to form the columns 
P= ( 1 | : of the transition matrix: 

2 —2 1 

Solution to Exercise C132 P=|2 1-2 
1 2 2 

5 2 0 32 0 
(a) D’ = € as) = ( 0 a We use the eigenvalues corresponding to the 


(b) We have A5 = PD5P-!, where Di A eigenvectors in E to form the diagonal matrix: 
e have = , where D is as in 


part (a) and 6 0 0 
P'AP=D=10 3 0 
000 


Solution to Exercise C134 


Since P~! = —t ( . 2) it follows that . 
=] 2 The characteristic equation of A is 
as— (72 2) (22 0\ (-2 + Tox. 0 0 
-a12 0 —243 1 2 0 2-A 1 |=0. 
a 0 1 2er 
f =33: =i 
ri: Sr) We expand the determinant and obtain 


2 
Solution to Exercise C133 en 
(There are many solutions possible for this and for 
each of the remaining exercises in this section, each (1 — A)(A* — 4A + 8) = (1— A)(A— 1)(A- 3) 
corresponding to a different ordering of the = 0. 
eigenvalues or a different choice of eigenvectors; in 
each case the matrix P should correspond to the 
matrix D so that P~'AP = D.) 


We use Strategy C20. 
The eigenvalues of A are À = 6, à = 3 and A= 0. 


which simplifies to 


The eigenvalues of A are therefore \ = 3, A = 1 
and à= 1. 


To find the eigenspaces of A, we consider the 
eigenvector equations 


(1—A)a =0 
The eigenvectors of A are the non-zero vectors of (2— A)y + y=) 


y+(2-X)2=0, 


for each of the eigenvalues. 


362 


The eigenvector equations become 


= 27 =0 
-y+z=0 
y—z=0. 


Sox =0,y=z. 
Thus $(3) = {(0,k,k):keR}. 


The eigenvector equations become 


Ox =0 
y+z=0 
y+z=0. 

So z = —y and there are no constraints on x. 


Thus $(1) = {(k,l,—-1) : k,l © R}. 
A basis for $(3) is {(0,1,1)} and a basis for S(1) is 
{(1,0,0), (0,1,—1)} because any vector in S(1) can 
be written as k(1,0,0) + 1(0,1,—1). The set 
E= {(0, 1, 1); (1, 0, 0), (0, 1; —1)} 


contains three vectors, so it is an eigenvector basis 


of A. 


We use the eigenvectors in E to form the columns 
of the transition matrix: 


0 1 0 
P=|10 1 
10-1 


We use the eigenvalues corresponding to the 
eigenvectors in E to form the diagonal matrix: 


300 
P'AP=D=]|0 1 0 
001 


Solution to Exercise C135 


(a) We have 
(2k, 2k, k) - (—21,1, 21) = —4kl + 2kl + 2kl 
= 0, 
(2k, 2k, k) - (m, —2m, 2m) = 2km — 4km + 2km 
= 05 
(—21, 1, 21) - (m, -2m, 2m) = —2lm — 2lm + Alm 
=0. 


Thus the given vectors form an orthogonal set. 
Since there are three of them, they form an 
orthogonal basis for R. 


Solutions to exercises 


(b) [va] = |(2k, 2k, k)| = VAR? + 4k? + k? 
= VOR? 
= 3k, 
|va| = |(-21,1,22)| = va? +? + 4? 
912 
=3l, 
|v3| = |(m, —2m, 2m)| = Vm? + 4m? + 4m? 
= Vy Im? 
= 3m. 
Thus |vı| = |v2| = |v3| = 1 if 
et p _ 1 
Solution to Exercise C136 
We calculate PTP. 
2 2 1 2 2 1 
3 3 3 3 3 3 
Tp _ 2 1 2 2 1 2 
P P=|-5 335 í = =5 
i 22 2) hi 2 2 
3 3 3 3 3 3 
2 0 0 100 
=/9 2 0oļ}=[|0 1 0 =1 
003 001 
Solution to Exercise C137 


(a) We use Strategy C22. 
The characteristic equation of A is 


g=A 
=2 


—2 


S 


We expand the determinant and obtain 
(1 -A)(6-%)-4=0, 

which simplifies to 
A? — 15A + 50 = (àA — 10)(A — 5) = 0. 


The eigenvalues of A are therefore A = 10 and 
A= 5. 


Next we find orthonormal bases for the 
eigenspaces. 


The eigenvector equations are 


(9 -A)t - 24 =0 
—2x2 + (6 — A)y = 0. 


363 


Unit C4 Eigenvectors 


A=10| The eigenvector equations become 
=g —2y=0 
—2x¢ —4y=0. 
These equations are equivalent to the single 
equation 
x+2y=0, 


that is, x = —2y. Thus the eigenvectors 
corresponding to A = 10 are the non-zero 
vectors of the form (—2k, k). 


An eigenvector of magnitude 1 corresponding 
to A = 10 is 


(34) 


The eigenvector equations become 


4z — 2y = 0 
—27+ y=0. 
These equations are equivalent to the single 
equation 
22 —y=0, 


that is, y = 2x. Thus the eigenvectors 
corresponding to A = 5 are the non-zero 
vectors of the form (k, 2k). 
An eigenvector of magnitude 1 corresponding 
to \=5 is 
eva) 
v5 V5) ` 
It follows from Theorem C64 that an orthonormal 
eigenvector basis of A is 


EE 


We use the eigenvectors in E to form the columns 
of the transition matrix: 
2 1 


v v5 
We use the eigenvalues corresponding to the 
eigenvectors in E to form the diagonal matrix: 


Tap n (10-0 
Para: 


(b) The eigenvalues of A are given as À = 6, A= 3 
and A = 2. 


364 


Now we find an orthonormal eigenvector basis 
of A. 


The eigenvector equations are 


(5 — A)z - Y- z= 
—x + (3 — A)y + z=0 
=£ + y+(3-A)z= 
A=6, The eigenvector equations become 


—“£- y= z=0 

-2—- 3y+ z=0 

=F y-3z=0. 
Adding the first and second equations 
together, we obtain 

—2¢ — 4y = 0, 
so x = —2y. Substituting this into the third 
equation, we obtain 

3y — 3z = 0, 
so z = y. Thus the eigenvectors corresponding 
to A = 6 are the non-zero vectors of the form 
(—2k, k, k). 
An eigenvector of magnitude 1 corresponding 
to A = 6 is 


mar) 


The eigenvector equations become 


2x -y-z=0 
= +2z=0 
=74 Fy = 0. 


The second and third equations imply that 

z = x and y = x. These satisfy the first 
equation. Thus the eigenvectors 
corresponding to À = 3 are the non-zero 
vectors of the form (k, k, k). 

An eigenvector of magnitude 1 corresponding 
to A\=3 is 


Gss) 


The eigenvector equations become 


3r- y-z=0 
-t+y+z=0 
-t+y+z=0. 


Adding the first and second equations 
together, we obtain 


2x = 0, 


which implies that « = 0. Substituting this 
into the third equation, we obtain 
y +z=0, 


which implies that z = —y. Thus the 
eigenvectors corresponding to À = 2 are the 
non-zero vectors of the form (0, k, —k). 


An eigenvector of magnitude 1 corresponding 


to A = 2 is 
i-a) 


It follows from Theorem C64 that an orthonormal 
eigenvector basis of A is 


(ala) 
a) 


We use the eigenvectors in E to form the columns 
of the transition matrix: 


2 is 4% 
v6 v3 

lin (al 

| v6 v3 V2 
E A 3 
T A A 


We use the eigenvalues corresponding to the 
eigenvectors in E to form the diagonal matrix: 


6 0 0 
0 3 0 
0 0 2 


PAP =D = 


Solution to Exercise C138 
We use Strategy C22. 


A basis for the eigenspace S(3) is {(0,1,1)}, so an 
orthonormal basis for $(3) is 


ee] 


A basis for the eigenspace S(1) is 
{(1,0,0), (0,1,—1)}. 


These two basis vectors are orthogonal since 


(1,0,0)- (0,1,-1) =0. 


Solutions to exercises 


An orthonormal basis for $(1) is therefore 


(wen (eh 


By Theorem C64 an orthonormal eigenvector basis 
of A is therefore 


OEN 


We use the eigenvectors in E to form the columns 
of the transition matrix: 


0 1 0 
1 1 
P=|8° V 
Een 
a a 


We use the eigenvalues corresponding to the 
eigenvectors in E to form the diagonal matrix: 


3 0 0 
010 
00 1 


PTAP=D= 


Solution to Exercise C139 


By Theorem C65, to prove that the product PQ is 
orthogonal it is sufficient to show that 


(PQ)* = (PQ). 
But 

(PQ)? =QTPT = QP = (PQ). 
Solution to Exercise C140 


(a) To verify that A is orthogonal, it is sufficient 
to show that ATA =I, by Theorem C65. 


0 0 1\ /0 0 =I 
ATA=| 010|loı =O 
-100/\10 0 


so A is orthogonal. 


(Alternatively, we could have shown that the 
vectors (0,0,1), (0,1,0) and (—1, 0,0) form an 
orthonormal basis for R.) 


365 


Unit C4 Eigenvectors 


(b) We evaluate the determinant of A: We use the eigenvectors in E to form the columns 
00 1 of the transition matrix: 
01 o =0-0- [9 Jer pele ene 
10 0 p-| vd v5 
u 1 2 
a : antati 3 eee 
Therefore A represents a rotation of R°. VB V5 
Solution to Exercise C141 We use the eigenvalues to form the diagonal matrix 
(a) The ellipse with equation PTAP=D= ¢ 
a ag 0 -3 
ae T mi 1 It follows from equation (9) that the equation of 
is written in matrix form as the conic is now 
2 0\/x' 
1/a? 0 oe 
ca nr CG IC) 
2 1 
So the ellipse in standard position has -— —= 
p p Us ae ee E 
1/a2 0 0 +(6 2)| YY ,) +21=0, 
S ya) and = (6): Een Y 
/ v5 V5 
(b) The hyperbola with equation that is, 
a 2(2')? — 3(y')? + 6V5y/ + 21 =0. 
a m 
is written in matrix form as Solution to Exercise C143 
2 
gT er pp) *+ ( 0)x-1=0. We have 
2(2')? — 3(y')? + 6V5y' + 21 = 0, 
So the hyperbola in standard position has 
which is equivalent to 
A= 1/a? 3 and J = 7 
lo -1/0? =i] 2(x')? -3 (w? = avy’) +21 =0. 
(c) The parabola with equation Completing the square gives 
2 _ 
y? = 4ax U — 3(y' — V5)? + 15 + 21 =0, 
is written in matrix form as 36 
0 0 
xT G 1 x+(—4a 0)x+0=0. 2(2')? — 3(y' — V5)? +36 = 0. 
So the parabola in standard position has We set the new coordinates to be 
APER „ IN _ f ke 
re i and = (70°) x =(x,y )e Gy v5), 
so substitute x” = x’ and y” = y' — V5. 
Solution to Exercise C142 The equation of the conic is now 
By Theorem C64 an orthonormal eigenvector basis 2(2”)? = 3(y")? = — 96, 
of A for the eigenvalues À = 2 and A = —8, in that 
order, is on 
1152 12 
TE +. 


366 


Solutions to exercises 


Solution to Exercise C144 


1. Introduce matrices. 


Solution to Exercise C145 


1. Introduce matrices. 


We have 


2. Align the axes. 


We have 
T _ (10 0 
prap=(10 0) 
where 
2 1 
_| vb Vv 
1 2 
5% 
So 


(f g)=(-10 -20) 


50 
=(0 =) 
=(0 -10/5). 
The equation of the conic is now 
10(x’)? + 5(y')? — 10V5y’ — 5 = 0. 
Dividing through by 5, we obtain 
2(2')? + (y')? — 2V5y' -1=0. 


3. Translate the origin. 


al al 
al’ al 


We write this equation as 
2(x')? + (w? = 2v5y') =1 20 


Completing the square in this equation, we 
obtain 


Da! Ph + (y — V5} =H 1 = 0, 
Substituting z” = x’ and y” = y' — V5 in this 
equation and simplifying, we obtain 

2(2”)? a (y")? —6=0. 

The equation of the conic in standard form is 
(z")? (y)? 
The conic is an ellipse. 


=]; 


We have 


asà a) ma a= (25) 


2. Align the axes. 


The characteristic equation of A is 


1-X -2 


D ee 


We expand the determinant and obtain 
(1-X)(4-X)-4=0, 
which simplifies to 
A? — 5A =AA—5) =0. 
The eigenvalues of A are 5 and 0. 
The eigenvector equations are 
(1 —A)z — 2y = 0 
—22 + (4—A)y =0. 
A=5, The eigenvector equations become 
Ar — 2y =0 
27 —- y=0. 
These equations are equivalent to the 
single equation 
2x +y =0, 
which implies that y = —2a. Thus the 


eigenvectors corresponding to A = 5 are the 
non-zero vectors of the form (k, —2k). 


An orthonormal basis for $(5) is 


1 2 
(ar 
The eigenvector equations become 
xz — 2y=0 

22 + 4y = 0. 
These equations are equivalent to the 
single equation 

zx — 2y = 0, 
which implies that x = 2y. Thus the 


eigenvectors corresponding to A = 0 are the 
non-zero vectors of the form (2k, k). 


367 


Unit C4 Eigenvectors 


An orthonormal basis for $(0) is 


(WR) 

v5 v5 
By Theorem C64 an orthonormal eigenvector 
basis of A is therefore 


(5-3). Ga) 


We use the eigenvectors in E to form the 
columns of the transition matrix: 


so 


| 

= 

Q 

— 

I 

rn 

| 

Dp 

| 

00 

— 
alel- 
Ot ot 
Sle Sly 
Ot ol 


10 20 
7 (7 =) 
= (2v5 -4,5). 
The equation of the conic is now 
B(x)? + 2/52’ — 4V5y' +5 = 0. 
3. Translate the origin. 


We rewrite this equation by taking out the 
coefficient of the (a’)? term to get 


2 
5 ( (a! 4) — 4V/5y' +5 =0. 
(+ y 
Completing the square in this equation, we 
obtain 


1\2 
5 r+) iA +5=0. 
( v5 g 
We substitute 
il 
gl! = x’ + Ze 
v5 

into this equation and rewrite it by taking out 
the coefficient of the y’ term to get 


5(2”)? — 4v5 (v - =) =0. 


368 


We substitute 
1 
R En de) 
yY yY WG 
to obtain 
5(2”)? = Ay" = (0. 


The equation of the conic in standard form is 
Grassy 
The conic is a parabola. 


Solution to Exercise C146 


1. Introduce matrices. 


We have 
1 0 0 —2 
A=ļ|0 10], J= 4 
0 0 1 —6 


2. Align the axes. 


The matrix is already in diagonal form. (The 
axes of the quadric are parallel to the z-axis, 
y-axis and z-axis of R.) 


3. Translate the origin. 


We write the equation as 
(2? — 2x) + (y? + 4y) + (2? — 6z) — 11 =0. 


Completing the squares in this equation, we 
obtain 


Ged? lee 2) 


11=0. 
Substituting 
X =xz-1l, y=y+2 and 2 =z-3 
in this equation and simplifying, we obtain 
(+ N? +)? 
The equation of the quadric in standard form is 


CP WP, 
25 = 25 z 25 


This is the equation of an ellipsoid. 


25 = 0. 


=l; 


(This ellipsoid is in fact a sphere since 
a = b = c = 5; all the curves of intersection are 
circles.) 


Solution to Exercise C147 


1. Introduce matrices. We have 


4 2 0 12 
A=(2 3 2], J=| 0 
0 2 2 12 
2. Align the axes. 
We have 
6 0 0 
P7AP=[|0 3 0|, 
0 0 0 
where 
2 _2 1 
3 3 3 
2 1 2 
P=|3 3 -5 
1 2 2 
3 3 3 
So 


WIN Wh wN 


(f g k)=(12 0 12) 


wie WIN wwo 
WIN WILY w= 


=(12 0 12). 
The equation of the quadric is now 
6(2')? + 3(y’)? + 122’ + 122' +18 = 0. 
3. Translate the origin. 
We write this equation as 
6 ((2’)? + 2x") + 3(y’)? + 122’ + 18 =0. 


Completing the square in this equation, we 
obtain 


6(a’ +1)? — 6 +3(y')? + 127 +18 =0. 
Substituting 
X =x'+1, Y=y and 2 =27/4+1 
in this equation and simplifying, we obtain 
ey. + (y? +42” =0. 
The equation of the quadric in standard form is 
= + cae = =z". 


This is the equation of an elliptic paraboloid. 


Solutions to exercises 


369 


